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Picturephone® Silicon Target 
Signal Analysis 


By W. E. BEADLE and A. J. SCHORR 
(Manuscript received January 5, 1970) 


Signal characteristics of the silicon diode array used as the image sensing 
element of the Picturephone® camera tube have been studied using a 
numerical computation. Analytical representation of the target is based on 
the numerical calculation of the depletion region geometry for an array 
diode unit-cell undergoing discharge. This analysis includes the effects of 
Si-S10, fixed interface charge density, sea resistance, substrate resistivity, 
P* island geometry, and SiO.-Si surface inversion phenomenon. 

Computed depletion region geometries are used to calculate surface and 
bulk contributions to array dark current. It is shown that signal current 
limitations due to surface inversion can be avoided using lower values of 
sea resistance coupled with higher Si0.-Si interface fixed charge density. 
For the resistive sea target structure, inversion effects are less pronounced 
for targets fabricated with P* islands larger than 10 ym diameter and sea 
sheet resistances less than 10"* ohms per square. This signal limiting effect 
can also be eliminated by using a conductive overlay structure. 

Results of analyses of lag characteristics and electron beam limitations 
are also presented. 


I. INTRODUCTION 


The silicon target, used as the image sensing element in the Picture- 
phone® camera tube,’ is composed of a matrix of diffused diodes. 
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Signal characteristics of the target are the result of a complex interaction 
of the material and geometric parameters of the individual array diodes 
as well as the operating conditions of the target. We have analtyically 
investigated target signal characteristics by means of numerical com- 
putations, solving explicitly for depletion region geometry and capaci- 
tance and have determined signal characteristics as a function of 
Si-SiO, interface fixed charge density, sea resistance*, substrate resis- 
tivity, P* island geometry, SiO.-Si surface inversion phenomenon, and 
electron beam acceptance characteristics. 

Section IT describes the diode array format and the basic relationship 
of signal current to array capacitance. The model of the reversed biased 
unit cell and the mathematic analysis are described in Sections II and 
III. Signal capabilities of a resistive sea diode array structure are 
discussed in Section IV and extended to include the effects of electron 
beam acceptance in Section V. An alternate geometric structure using 
conductive overlays is analyzed in Section VI. The relationship between 
the array dark current characteristic and target parameters is presented 
in Section VII. In the concluding section we present experimental 
verification of the model. 


Il. GENERAL TARGET OPERATION 


The Picturephone® camera tube target is an array of 8 X 10° 
planar P*N diodes fabricated on a thin N-type silicon membrane, 
(Fig. 1). The optical sensing mechanism of the target is the discharge 
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DIODE ARRAY ~ 
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Fig. 1—Silicon camera tube target. 


* We use the term ‘‘sea”’ to describe the resistive layer covering the array surface 
containing the P* diffused regions which we will refer to as P+ islands. 
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of this array of diodes by light incident on the N* surface (side opposite 
the array). To review the details of the operation it is convenient to 
consider an individual array element, or unit cell, represented by a 
15um square section of the array containing a centrally located diode. 

The back surface of the target is held at a potential V;. The target 
face potential is periodically reduced to ground potential by the scanning 
electron beam (Fig. 2). For ideal beam acceptance, the target unit cell 
(consisting of the junction plus the oxide capacitance) is reverse biased 
by an amount V7 immediately after scanning. At this time, the junction 
depletion region and any existing silicon surface depletion region will 
be at their geometric maximum. During the time interval (corresponding 
to a frame time, rr), that the electron beam is scanning the remainder 
of the target, the diode cell is discharged by the photon generated holes 
which diffuse to the depletion region of the device. 

The relationship between light-generated minority carriers available 
for discharge and the incident photon flux and the subsequent diffusion 
of these carriers to the diode depletion region has been discussed by 
others.* The individual diode steady state signal current can be related 
to the diode discharge by 


1 Vr 
= C(V) dV (1) 
Tr UVe 
where 
C(V) = diode cell capacitance, 
V7 = the voltage across the diode cell at the start of a frame, 
Vy, = the voltage across the diode cell at the completion of a frame. 


The upper integration limit will be less than V7 if the tube has 
capacitive lag. Maximum signal occurs when Vy = 0, which corresponds 


——RESISTIVE SEA 









Vy 


ey Yi a eee 


yy a 

Lic (KMY); teat ba >3 
SEL HOR] + tN 3 —___ 

g Uy ii oS ae 
Nt—7 BZ Trove + Sek eS 

g Yy an es ELECTRON BEAM 

g oT SS LS 

4 os a 
+ + 

Z 

g 


~~~SiOs (.85p THICK ) 


\ 
~-DEPLETION REGION 


Fig. 2—Target in scanning mode. 
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to ideal electron beam acceptance. Additional charge storage can be 
obtained by driving the diode cell into forward bias. Since increased 
lateral hole spreading causes loss of resolution under this condition, we 
restrict our analysis to reverse bias conditions. 

Figure 3 schematically shows the equivalent components associated 
with the unit cell structure. These consist of: 


(t) the diode junction capacitance, C, , 
(iz) the oxide capacitance, Cy, , and 
(22t) the silicon surface capacitance at the silicon-oxide interface, Cp zp. 


2.1 Mathematical Model of a Reversed Biased Diode Unit Cell 


For purposes of analysis, an individual diode can be considered as an 
axisymmetric structure composed of several mathematically distinct 
regions. Figure 4 is a schematic representation of the analytical model. 
The regions of interest include: 


(t) Oxide mask— The oxide region is considered to be a homogeneous, 
charge free region with dielectric constant ¢, . The potential function in 
this region is determined by solution of Laplace’s equation. 

(it) P* diffused island—This region is considered to be at a uniform 
potential. 
(iit) N-type bulk region—That portion of the N-type bulk region, 
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Fig. 3—Target discharge model. 
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Fig. 4—Diode unit cell geometry. 


remote from the depletion region, is considered to be at uniform poten- 
tial. 

(wv) Depletion region—The potential distribution in the depletion 
region is described by Poisson’s equation. The charge distribution in 
the depletion region remote from the oxide-silicon interface is considered 
constant and set by the doping level of the N-type bulk material. An 
additional charge density is assumed in the depletion region at the 
Si-Si0, interface to describe the oxide fixed charge. The depletion region 
has a dielectric constant €> . 


2.2 Boundary, Interface Conditions 


The target analysis is complicated because the target consists of 
axisymmetric diffused regions arranged on a square format and spaced 
such that the diode depletion regions can connect for some combinations 
of target parameters. We analyze an axisymmetric structure and correct 
for the effects of the cell corner regions when necessary. 

Except for the boundary separating the depletion and N-type bulk 
regions (discussed in Section 2.3), the boundary conditions for this 
problem are straight-forward: 


(2) There is a radial potential distribution on target surface. 
(zz) The radial potential gradients along the axis of symmetry are 
zero. 
(iz) The condition of conservation of charge flux is applied at the 
oxide-depletion region interface. 
(wv) Radial potential gradients at the diode periphery are zero. 
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2.3 Method of Solution 


Regions of dissimilar material constants and geometrically compli- 
cated boundaries make solution by ‘‘closed form” techniques difficult. 
Tor this reason and because of its inherent geometric flexibility, a 
numerical or finite difference solution was adopted. 

The entire diode structure is considered to be composed of differential 
elements spaced on a square mesh (Fig. 5). The potential of each ele- 
ment is described by an appropriate difference equation which takes 
into account the material constants of the element and its interaction 
with adjacent elements. For example, the potential of an element in the 
depletion region at a location (J, J) is given by a difference equation 
of the type: 


| Ue JAN VG, Vw = Dee VG, D| 











AZ AZ 
VU =1,))-— VED] E +1,/)— VU, | 
a A] AR tA, AR 
= —¢q AZA,N(I, J) (2) 
where 
VU, J) = potential of point (/, J/), 


A,,A.,A_ = areas of element faces, 
e = dielectric constant, 
AR = radial element spacing, 
AZ = axial element spacing, 
qN (I, J) = charge density. 


Similar difference equations can be written to describe the behavior of 
other regions. 

Thus the potential of any element can be described by a linear 
algebraic equation. If the geometry of interest is described by decompos- 
ing the structure into an Mf by N array of such elements, then the 
problem is reduced to the solution of an M X N set of linear equations. 
For a typical geometry considered in this study, the structure was 
represented by a 70 by 85 array of elements. This set of equations was 
solved by accelerated Gauss-Seidel iteration.” 

An important aspect of the solution of this problem is the specification 
of the depletion region boundary. The P*-depletion layer boundary is 
defined by the doping profile. We assume a P* step junction. The 
depletion region-N-type bulk boundary is a function of impurity dis- 
tribution, oxide-silicon interface charge density, and resistive sea surface 
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Fig. 5—Difference elements. 


potential and must therefore be computed for each new combination of 
diode parameters. An iterative procedure was used to calculate potential 
and to define the limits of the depletion region. Beginning at the edge 
of the P* diffused island an iterative sweep through the potential field 
was made. At those points where the computed potential was less than 
the applied potential plus the built-in potential of the junction, Vz , 
Poisson’s equation in difference form was applied. When a potential 
equal to or greater than V7 + V, was computed, the potential at that 
point was set equal to V7 + Vz. This procedure was repeated until a 
satisfactory solution was obtained. 


2.4 Capacitance Calculation 


Depletion region capacitance was calculated by treating the depletion 
boundaries as the surfaces of a two-plate capacitor. The potential field 
between the plates of such a capacitor was calculated by solving Laplace’s 
equation, taking into account the different dielectric constants for the 
oxide and silicon regions. Charge density on the surface is given by 


Q = eli,, (3) 


where /, is the normal electric field component calculated using dif- 
ference approximations for the first derivative of the potential, and ¢ is 
the appropriate dielectric constant. 

Total diode capacitance is then equal to the integral of the local 
charge density divided by the local voltage over the total cell surface 
area. 
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Ill. MATHEMATICAL MODEL OF IDEAL CELL DISCHARGE 


The model considered in the analysis of signal capability is shown 
schematically in Fig. 3. For all subsequent calculations the target is 
biased at a 12-volt potential relative to the cathode (except as noted). 
Both cathode potential drop and beam limitations (including the effect 
of the resistive sea impedance in series with the P* island) are neglected 
in this portion of the analysis. These assumptions imply sufficient beam 
current to completely restore the target surface to cathode potential 
after each beam scan, re-establishing the initial 12-volt reverse bias 
across the junction. (With our convention reverse bias and surface 
potential sum to 12 volts.) During the time interval during which a 
diode is disconnected from the electron beam, rr, an array diode is 
discharged by the light generated hole current collected by its depletion 
region. As the unit cell is discharged, the P* island potential increases 
approaching 12 volts for high illumination levels. The voltage profile on 
the target surface depends on substrate resistivity, SiO.-Si interface 
fixed charge density, target geometry, illumination level, resistive sea 
sheet resistance and electron beam acceptance characteristic. For the 
case of a very high resistance sea—that is, no lateral charge flow onto 
the sea—the potential of the resistive sea surface remote from the diode 
windows will remain at ground potential and the light generated current 
will act to discharge the P*N junction only. Conversely, for the case of 
a low sheet resistance sea the entire resistive sea surface potential will 
rise nearly uniformly. 


3.1 Qualitative Description of Model 


The time dependent solution of the diode cell discharge transient is 
a complicated non-linear problem. However, a reasonable model for 
this problem should include the voltage dependence of both the junction 
and §$i0,-Si surface capacitances. The silicon surface capacitance is 
important since it acts in series with the oxide capacitance to affect the 
time constant of the resistive sea. For example, in the depleted mode the 
equivalent oxide surface capacitance can be half the oxide capacitance. 

The lateral voltage drop on the diode unit cell surface makes possible 
the formation of an inversion layer at the SiO.—Si interface. Since this 
effect can (for some combination of target parameters) establish the 
upper limit on sea resistance, the model should include inversion 
phenomenon. Strong inversion was assumed when the magnitude of the 
surface potential relative to the bulk semiconductor potential exceeds 
the junction reverse bias, Vz , by twice the magnitude of the bulk Fermi 
potential, dp hy 
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A quasi steady-state solution was used in which the discharge time 
interval was divided into sub-intervals over which the surface capaci- 
tance was assumed time invariant. The dependence of the junction 
capacitance on voltage was explicitly described. 

An outline of the unit diode cell discharge analysis is shown in Fig. 6. 
For computational] purposes, rr was divided into several equal segments. 
The computational sequence used during each time segment was as 
follows: 


(z) At the beginning of each segment: First, the unit cell depletion 
geometry and corresponding surface capacitance were calculated subject 
to boundary conditions which include the resistive sea surface potential 
distribution from the previous time step. Second, the surface potential 
profile was recalculated to adjust for altered surface capacitance by 
imposing temporal continuity of surface charge. 

(72) Having obtained the surface potential and capacitance profiles, 
a transient solution to the array discharge process for the time segment 
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Fig. 6—Unit cell discharge analyses. 
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was performed with the surface capacitance profile held constant. The 
voltage dependence of the junction capacitance was accounted for. 
The time dependent surface potential profile and sea current were then 
calculated. The final potential profile was used in the first part of step 7 
of the next time interval. 


3.2 Some Particular Details of Target Operation 


Target operation can be most easily characterized by considering 
operation either below or above total depletion of the silicon surface at 
the Si-SiO, interface. When operating below total surface depletion, 
variations in sea sheet resistance can alter only the width of the depletion 
layer at the surface. Above the condition of total surface depletion, the 
resistive sea sheet resistance can control the formation of a silicon 
surface inversion layer. It should be noted that because of geometrical 
effects, total surface depletion may occur in the diode array at voltages 
significantly below the depletion voltage which would be obtained on 
a MOS capacitor with the same oxide thickness and interface properties. 


3.2.1 Target Operation Below Total Surface Depletion 


Consider a 10 Qcm substrate target with an interface fixed charge 
density (Q,,/q) of 6 X 10°'/em’ and V, = 12 volts. Under these condi- 
tions the target will be below total surface depletion. Figures 7 and 8 
illustrate the behavior of the depletion region and resistive sea surface 
potential at various intervals during discharge for two values of sea 
resistance. 

Comparison of the depletion layer geometries at t = rp (corresponding 
to full discharge) shows that the signal obtained from the junction-only 
capacitance is not appreciably altered by differences in sea sheet resist- 
ance. The surface potential profiles at t = ry clearly indicate the dif- 
ference in surface charge flow for the two values of sheet resistance. An 
effective frame time of 1/60 second is assumed.* 

Figures 9 and 10 show the corresponding sea current and junction 
potential variations over one frame time for the same two sea resistance 
values. In both cases the junction voltage reaches the 12 volt bias 
potential prior to the completion of the frame time. Therefore, the light 
discharge current specified in the calculations 6.4 pA was in excess of 
true saturation, and lateral hole diffusion in the bulk occurs during the 
latter portion of the discharge cycle. For example, for the 10** Q/O 
resistive sea (Figs. 8 and 10), 3.1 pA of the 6.4 pA light signal corre- 


* The system frame time is 1/30 second; however, due to. the interlace scan 
system and large beam diameter, the diodes are charged at a 1/60 second rate. 
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Fig. 7—Calculated discharge conditions for low sea-sheet resistance. 


sponds to the discharge of the junction-only capacitance. The time- 
averaged sea current is 1 pA. The lateral loss due to diffusion is 2.8 pA. 
By decreasing the sca resistance to 10° 2/0 (Figs. 7 and 9) the average 
sea current is increased to about 2 pA-giving a total average cell dis- 
charge current of 5.1 pA. 


3.2.2 Target Operation Above Total Surface Depletion 


For target operation at bias voltages where the surface can be totally 
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Fig. 8—Calculated discharge conditions for high sea-sheet resistance. 


depleted, the sea sheet resistance is more critical. Sea sheet resistance 
not only controls the coupling of the oxide and junction capacities but 
can allow the formation of an inversion layer prior to total cell discharge. 
This effect can result in an additional limitation on maximum useful 
signal current. For example, consider the final depeltion region con- 
figuration (Fig. 11) for a substrate resistivity (p) equal to 10 Qem, 
Si-SiO, interface fixed charge density (Q,./q) equal to 3 X 10°/cm’, 
and a sea sheet resistance (R,) equal to 10* 2/0. Because of the lower 
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Fig. 9—Calculated discharge characteristics for low sea-sheet resistance. 


Q../q, the depletion region is markedly different from that of the unit 
cell previously considered (Figs. 7 and 8). Note that the depletion layer 
exists even at the end of the frame time. Furthermore, the final dis- 
charged condition shows surface inversion under part of the oxide sur- 
face. If the inversion layer acts to interconnect adjacent P* islands, 
then loss of resolution will occur.* Resolution loss does not occur simply 
by the existence of an inversion layer but rather when sufficient current 
flow occurs between the P* island and the inversion layer to connect 
adjacent P* islands. To investigate the formation of the inversion layer, 
it is necessary to examine the Si-SiO, interface potential distribution. 
Figure 12 shows the potential profile (for Q../¢ = 2 X 10°’/cm’ and 
R, = 1.5 X 10% Q/D) at three time intervals in the discharge cycle. 
It can be seen that all the light-generated minority carriers which reach 
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Fig. 10—Calculated discharge characteristics for high sea-sheet resistance. 
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Fig. 11—Calculated discharged conditions for above total surface depletion 
operation. 
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Fig. 12—Si-SiO, interface potential at the end of three equal time intervals during 
discharge. 
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the depletion region are collected by the P* island until that time, late 
in the discharge cycle, when the silicon surface potential remote from 
the P* island drops to below the P* island potential. This condition only 
occurs for high sea-sheet resistance which causes a large radial voltage 
gradient on the sea surface. 

Figure 13 illustrates the $i-SiO, interface potential distribution 
relative to the P* island potential at the end of the discharge cycle for 
several values of light-generated current. This figure shows that for 
low current levels a potential barrier for holes exists between the P* 
island and inversion region. The magnitude of this barrier decreases as 
the junction current is increased. We have assumed that loss of resolu- 
tion occurs when this potential barrier drops below 2kT. For this 
particular example, the diodes effectively interconnect at a light- 
generated current of 2.1 pA, and the strongly inverted layer need only 
extend to within 2.6 um of the P” island. 


IV. TARGET SIGNAL CAPABILITIES 


In this section calculated results are presented which illustrate the 
general effect of diode parameters and operating bias on target signal 
capabilities. 
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Fig. 13—Si-SiO, interface potential relative to P+ island potential at completion 
of the discharge cycle for several levels of light-generated discharge current. 
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4.1 Effect of Signal-Limiting Mechanisms 


Signal current characteristics of target arrays fabricated on 10 Qem 
substrates with Q,,/q equal to 1, 2, and 3 X 10"’/cm’ are shown in Fig. 14. 
This figure graphically illustrates the interdependence of the Si0.-Si 
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Fig. 14—Effect of Qss/9 on single cell signal current characteristics, 
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interface charge density and sea sheet resistance on signal current 
characteristics. 

Oxide-silicon interface fixed charge density affects the array signal 
capacity for all values of sea resistance. For example, consider a target 
fabricated with a low sea sheet resistance of 10 Q/L. If this array has 
an interface fixed charge density of 3 X 10°*/em’, it will provide a 
6.4 pA signal current per diode. A similar target having a fixed density 
of 1 X 10*’/cm’ is capable of only 4.5 pA per diode. Using an estimated 
minimum sea resistance consistent with resolution,* of Rs = 10° 2/0, 
the maximum usable signal current per diode is 5 pA for Q,,/q = 
3 X 10”/em’; and 3.6 pA for Q,,/¢ = 1 X 10"’/cm’. 

For high sea-sheet resistances, above 10° 9/0, the effect of fixed 
interface charge density on signal is even more pronounced since the 
inversion mechanism becomes operative. Consider the case of 
Rs = 10% Q/O. From Fig. 14 we see that for Q,./¢ = 3 X 10"/cem’, 
the available signal is 3.4 pA/diode. If, however, the interface fixed 
charge is reduced to 1 X 10"'/cm’, the inversion mechanism becomes 
operative and the maximum usable signal current is reduced to 1.8 pA. 


4.2 Specific Examples of Geometric Effects 


Certain combinations of P* island geometry, interface fixed charge, 
and sea resistance can result in a limitation in the dynamic signal 
capability of the target due to diode interconnection caused by silicon 
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Fig. 15—Discharged conditions for 4 um window diode, 
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surface inversion. Consider the three geometries shown in Figs. 15, 16 
and 17 which illustrate the effect of two values of sea resistance on 
maximum signal current. The resistive sca potential immediately after 
the electron beam scan is assumed to be zero volts. The final surface 
potential and depletion region geometry are shown for the maximum 
discharged condition. 

In this example, the maximum P” island surface potential is limited 
to 10 volts for the 4 um window geometry and R, = 10” Q/Q; this 
corresponds to a signal current of 0.8 pA. Signal current for this same 
geometry with R, = 10*Q/O is 1.6 pA. The difference in signal currents 
results from surface inversion at the higher value of sea resistance. This 
current-limiting effect is less pronounced for the larger diode diameters 
as illustrated in Figs. 16 and 17. The effect of P* island diameter on 
signal current characteristics for Q,./¢ = 1, 2 and 3 X 10"/cm’ is 
shown in Figs. 18 through 20. These figures graphically illustrate the 
interdependence of the Si0,-Si interface charge density and sea sheet 
resistance on signal current characteristics. 

The junction depth for the diode configurations of Figs. 18, 19, and 
20 is 2.5 um. Junction depth has a pronounced effect on signal current 
capabilities for values of sea sheet resistance greater than 10** Q/O as 
shown in Fig. 21. In this example, a 1.5 wm deep junction will experience 
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Fig. 16—Discharged conditions for 6 um window diode. 
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Fig. 17—Discharged conditions for 8 ym window diode. 










a 43 percent signal limitation due to surface inversion compared with a 
2.5 wm deep junction of the same diameter. 

The effect of target bias on usable signal current has also been inves- 
tigated. Increased target bias reduces usable signal for targets with 
high sea resistance and low interface fixed-charge density. Signal 
capabilities of targets operated at 12- and 16-volt bias can be compared 
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Fig. 18—Single diode signal capability for Qss/¢ = 1 X 10" em~*. 
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Fig. 19—Single diode signal capability for Q../¢ = 2 X 10" em7?. 
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Fig. 20—Single diode signal capability for Q;;/¢ = 3 X 10% cm™?. 
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Fig. 21—Effect of P+ island depth on single diode signal characteristic. 
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in Fig. 22. For R, greater than 10* 2/0, reduction of usable signal for 
the higher bias case is due to inversion effects caused by larger sea 
surface-potential gradients. 


V. EFFECT OF ELECTRON BEAM ACCEPTANCE 


In order to estimate the effects of electron beam acceptance on signal 
characteristics, the following experimental-analytic approach was 
undertaken: 


(z) A mathematical model of the charge-discharge mechanism of the 
target in the scanning mode was established. 

(iz) For several target bias conditions, maximum signal current 
(maximum light level consistent with resolution) and the residual signal 
characteristic were measured. The residual signal lag was measured by 
chopping the target illumination as described in Section 5.2. 

(7) Fabrication parameters for this target (Q,,/q, P* island geom- 
etry, sea resistance and substrate resistivity) were measured. 

(iv) The junction-only capacitance-voltage characteristic was calcu- 
lated from measured target parameters. In this instance, sea resistance 
was sufficiently high so that the junction-only capacitance was the only 
signal-generation mechanism. 

(v) Using the capacitance voltage characteristic of the junction, the 
saturation signal and lag characteristics of the tube, and the mathe- 
matical representation of the target charge-discharge mechanism; a 
beam-landing function was generated by a trial and error procedure 
which satisfies these measured characteristics. In this calculation the 
contribution to lag of interface trapping states was neglected.* 

(vt) Using the beam landing characteristic thus generated and the 
mathematical model, lag and beam limited signal were calculated for 
various diode P* island geometries. 


5.1 Szgnal Limitations 


The resulting signal current versus P* island diameter for a junction 
depth of 2.5 um is given in Fig. 23. Comparison of these results with 
those for perfect beam landing shows that the beam limitation causes 
a significant reduction in target signal capability. For example, a diode 
having 8 ym P* island diameter and a 10'* 9/10 sea sheet resistance has 
a maximum signal current capability of 2.8 pA. With beam limitation, 
signal current is reduced to 1.0 pA. It should also be noted that the beam 


*G. F. Amelio has shown that under some conditions trapping states at the 
oxide-silicon interface can contribute to target lag. 
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Fig. 22—Effect of bias on single diode signal capability. 


current limitation will accentuate surface inversion for the higher values 
of sea resistance. As one might expect, larger diameter oxide windows 
improve beam collection efficiency and thus minimize signal limiting. 
The effect of beam acceptance on the surface inversion phenomenon 
is illustrated in Fig. 24. This figure shows the locus of operation voltages 
as a function of light level for an 8-um P” island diameter and a sea 
resistance sufficiently high to decouple the oxide capacitance. As light- 
generated discharge current level increases the maximum target surface 
potential approaches the target bias. For a sea resistivity of 10° ®/O 
the maximum allowable P* island potential can rise to the 16 volt bias 
potential without diode interconnection due to silicon surface inversion. 
If beam limitation is taken into consideration, the calculated potential 
swing is 12 to 16 volts resulting in a maximum signal level of 1.2 pA per 
diode. If the sea sheet resistance is increased to 10°’ Q/O, surface inver- 
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Fig. 23—Effect of electron beam acceptance on single diode signal capability. 
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Fig. 24—Calculated Pt island potential extremes as a function of discharge current. 


sion limits the P* island potential to 11.0 volts. The maximum signal 
current for this case is therefore only 0.9 pA. 


5.2 Capacitive Lag—Transient Illumination 


Thus far only time invarient illumination of the target has been 
considered. Non-ideal beam landing results in capacitive lag for transient 
illumination conditions. The definition of lag used here is illustrated in 
Trig. 25. Light is left on for a sufficient time interval for the target to 
assume steady state operation; that is, AQcharze = AQuischarge - Lhe light 
is then turned off and the instantaneous signal current at subsequent 
scanning intervals determined. Lag is then defined as the ratio of the 
signal current at 1/15 second after the light is turned off to the steady- 
state illuminated signal. 

Lag is a function of the resistive sea impedance, the diode capacitance, 
and the electron beam acceptance. The electron beam acceptance is a 
function of the resistive sea surface potential. Since both the instan- 
taneous diode capacitance and the sea surface potential are determined 
by the level of light-generated cell discharge current, lag is also a function 
of signal level as illustrated in Fig. 26. For the case under consideration, 
two effects are evident: 
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Fig. 25—Definition of lag. 


(z) At high signal levels lag is enhanced by large target cell capacity. 
(iz) At low light levels, where cell capacity is reduced, poor beam 
acceptance increases lag. 


The result for this example is a minimum lag condition at less than 
maximum signal current. Since this effect is a strong function of the 
electron beam acceptance characteristic, the width of the lag minimum 
can vary considerably from tube to tube. 


VI. ALTERNATE TARGET STRUCTURE 


The occurrence of inversion effects can be minimized, but not elim- 
inated, by fabricating targets with greater than 10 um diameter P* 
islands. 

Inversion effects may also be controlled by means of a partial conduc- 
tive overlay structure. 
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Fig. 26—Calculated lag characteristic. 
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6.1 Description of Overlay Structure 


This structure is essentially that of the standard resistive sea silicon 
target array except conductive buttons have been formed in the oxide 
windows and extend over the adjacent oxide. 


6.2 Analysis 


The analysis of this structure is performed assuming that the sea 
region between buttons is at ground potential (a worst case condition) 
with a uniform potential V7 assigned to the P* island and conductive 
overlay. 

Signal capabilities of this structure were also determined by calcula- 
tion of the capacitance voltage characteristic of the button. Again the 
sea region was assumed to be at zero potential and thus electrically 
isolated from the junction button. 


6.3 Conductive Overlay Characteristics 


Depletion region geometries for a range of conductive overlays diam- 
eters are shown in Fig. 27. For the assumed 12 volt bias three distinct 
silicon surface conditions are represented by these examples: 


(z) Separation of P* island from the peripheral surface inversion 
region by a neutral N region at the interface. This corresponds to a 
11.2 um diameter button. 

(it) Continuous depletion layer through the diode unit cell. For 
a 9.5 wm diameter button the inversion layer formed at the periphery 
is isolated from the P* island by an appreciable potential barrier. 

(az) Continuous depletion layer through the diode unit cell and 
electrical interconnection of the P* island and the surface inversion 
regions. This condition exists for the 8.0 um diameter button. 


For the 9.5 um button, which represents the minimum diameter 
necessary to prevent diode interconnection for the example under 
consideration, the maximum signal current is 1.9 pA/diode. This is 
slightly less than the current obtainable with a resistive sea structure 
with 10°* 2/0 sea sheet resistance. 

The major advantage of a properly fabricated conductive overlay 
structure is the elimination of inversion effects (which include signal 
limiting and video ‘‘blooming”’ phenomenon) inherent in the resistive 
sea type targets operated with total surface depletion. While elimination 
of inversion effects can be accomplished by controlling sea resistance 
(within the range of 1 to 5 X 10” Q/D for Q,./q & 10°’/em’) in con- 
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Fig. 27—Effect of conductive overlay diameter on diode discharged condition. 


junction with large diameter P* islands (>10 um), the button technique 
offers several fabrication advantages: 


(¢) Allowable range of sea resistance can be extended to 10° to 
10° 0/O. 

(iz) For this structure, the critical parameters are easily measured 
physical dimensions, that is, diffused island and button diameters. 
Furthermore, the button diameter is not particularly critical provided 
it is larger than some minimum (approximately 1.5 um greater than the 
P* island diameter). The maximum diameter is of course restricted to 
less than the diode center to center spacing, permitting substantial 
latitude on button size. However, two important fabricational restric- 
tions must be observed. To assure signal uniformity the button diameter 
must be uniform over the diode array. Non-uniformities will be evident 
under light saturated signal conditions where the oxide capacitance 
contributes a significant portion of the signal. The second requirement 


SILICON TARGET 947 


is that the P* island diameter be controlled to assure that the overlay 
will extend the required distance past the P* island edge to avoid 
inversion effects. 


VII. DARK CURRENT 


Surface area and volume components of the unit cell depletion region 
can be calculated as a function of target bias (see Fig. 28). Assuming that 
target dark current is the linear summation of the bulk and surface 
components, this geometric data, with appropriate scaling factors can 
be used to calculate the dark current characteristic, that is, assuming the 
surface component of dark current is given’ by 


1, = nA, (4) 


where 


So = surface recombination velocity in em/second, 


A, = depleted silicon surface area of unit cell, 
n; = intrinsic carrier concentration, 

N = number of diodes, 

q = electronic charge, 


and the bulk component is 


Tout = 
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Fig. 28—Calculated depletion region geometric factors. 
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where 


V = depleted unit cell volume, 
7, = effective bulk generation lifetime. 


For example, Fig. 29 illustrates the application of the above using 
measured values of S, and 7, and compares this result to the measured 
dark current characteristic. 

Figure 30 shows a family of dark current characteristics for a target 
with 5 X 10° diodes normalized to bulk and surface generation rates 
for a specific geometry. A 3-volt offset is used to account for beam 
limitations. 

A plot of dark current versus diode diameter for 10 Qcm material, 
Q../¢ = 3 X 10"/cm’, and Vr = 12 V is presented in Figure 31. It is 
clear, when operating above flatband, that a large P* island diameter 
improves the dark current by reducing the depleted surface area. 


VIII. SUMMARY 


The material presented thus far represents the results of a mathe- 
matical modeling of the silicon diode array. To be of practical use in 
target design the model must accurately represent target behavior. 
Since the complicated nature of the problem at hand precludes the 
normal testing of mathematical models by reduction to simplified cases, 
an experimental verification of the model’s accuracy is the only recourse. 
In this section we present some comparisons of the calculated and 
experimentally measured target behavior. We also draw some general 
conclusions concerning target design. 


8.1 Comparison of Experimental and Calculated Results 


Determination of the target depletion region geometry is fundamental 
to the analysis of target performance. The most convenient experi- 
mental verification of the accuracy of these calculated depletion region 
geometries is to compare the calculated depletion region capacitance for 
the case of a uniform surface potential with the measured capacitance 
of an array covered with a metallic overlay or dot. Figure 32 illustrates 
such a comparison. The solid curve represents the calculated C-V 
characteristic for a target with a SiO.-Si interface fixed charge density 
of 2 X 10**/cm’ and a resistivity of 5 Qem. The measured values were 
scaled from a gold dot covering 560 diodes. Notice the excellent agree- 
ment between the calculated curve and experimental measurements. 

A statistical comparison of theory with measured signal current 
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Fig. 29—Comparison of measured and calculated dark current characteristic for 
5 X 105 diode array. 
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Fig. 31—Calculated relationship between dark current and P* island diameter. 


versus diode diameter is given in Fig. 23. The range in measured maxi- 
mum signal current reflects the normal variations in beam acceptance 
and sea resistance. 

A comparison of calculated and measured dark current characteristics 
is illustrated in Fig. 29 for an array with measured parameters. It can 
be seen from these results that the bulk and surface components of dark 
current calculated from the depletion region geometric parameters and 
measured generation rates provide an adequate model for dark current. 


8.2 Some Generalizations on Target Array Geometry 


In general, several observations of the effect of target diode geometry 
on signal current can be made: 


(i) For high sea sheet resistance (2 10'* 9/1) larger diameter 
diodes have greater signal capabilities. 

First, for the case where signal is not limited by inversion this result is 
due to the simple geometric effect of having a larger capacitance asso- 
ciated with the junction. 

Second, for the case where signal is limited due to surface inversion, 
larger diameter P* islands have the effect of delaying the onset of 
inversion (higher P* island potentials are possible). 
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Fig. 32—Comparison of measured and calculated capacitance—voltage 
characteristic. 


Third, increased beam acceptance for large oxide window diameter 
reduces both lag and signal limitations. 

(iz) For the case where the major portion of diode leakage current is 
due to surface generation, increased P* island diameter reduces dark 
current due to the geometric effect of minimizing available current 
generating surface. 

(277) Silicon interface inversion effects can also be controlled by use 
of conductive overlay whose diameter exceeds the P* island diameter 
by 1.5 um. 
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Image Storage and Display Devices Using 


Fine-Grain, Ferroelectric Ceramics 
By A. H. MEITZLER, J. R. MALDONADO, and D. B. FRASER 


(Manuscript received January 22, 1970) 


Thin plates of ferroelectric ceramic in combination with transparent 
conductive and photoconductive films have been used to form device struc- 
tures that can store a real image as a spatial variation in birefringence. 
This image can be viewed directly by suitably polarized transmitted light or 
projected onto a viewing screen. The stored image is erasable by the applica- 
tion of appropriate combinations of light and electric fields. 


I. INTRODUCTION 


Recent publications have called attention to the usefulness of fine- 
grained, lead zirconate-lead titanate ferroelectric ceramics in several 
kinds of electro-optic devices.’~* This paper reports initial experimental 
results from ferroelectric picture devices (ferpics) based directly on the 
electro-optic properties of these new materials. In addition, this paper 
discusses how the basic device may be used to advantage in several 
types of display systems. 


II. BASIC PRINCIPLES OF OPERATION 


A thin, transparent plate of lead zirconate-lead titanate ceramic, 
as it is initially formed, is optically isotropic. By the process of poling, 
it can be given a condition of uniaxial birefringence dependent upon the 
remanent polarization of the material. The poled material has its optic 
axis parallel to the polarization direction. Figure 1(a) shows how, by 
means of electrodes applied to its edges, a plate is poled so that the 
remanent polarization lies unidirectionally in the plane of the plate. 
In this condition (L-state) the plate exhibits uniform birefringence 
for polarized light incident normally. An image is stored by switching 
at least a portion of the domain polarization vectors in the areas where 
the plate is illuminated to a direction perpendicular to the plane of the 
plate (T-state), thus reducing the birefringence of these regions. The 
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Fig. 1—Construction and operation of an elementary ferpic: (a) poling the ceramic 
plate, (b) storing an image, (c) observing the recorded image. 


required perpendicular switching field can be obtained by means of the 
arrangement shown in Fig. 1(b). The ceramic plate is coated on both 
sides with a photoconductive film and sandwiched between transparent 
electrodes. In front of the ferroelectric picture device (ferpic) is placed 
a high-contrast transparency illuminated by a beam of collimated 
incident light. When voltage is applied to the transparent electrodes, 
the high impedance of the dark photoconductive regions prevents the 
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field inside the ceramic from reaching a value producing a significant 
amount of switching. In the zllumznated regions, however, the impedance 
of the photoconductor is reduced and the field in the ferroelectric in- 
creases to the point that appreciable domain switching is produced 
causing the ceramic underlying the illuminated areas to be switched to 
the T-state. Thus a stored image is obtained as a spatial modulation 
of the birefringence of the ceramic. The image can be made visible 
by inserting the ferpic between a polarizer and analyzer as shown in 
Fig. 1(c). The image can be erased either by poling the sample in the 
plane (L-state) or by thermally depoling the material. 


III. EXPERIMENTAL RESULTS FROM AN ELEMENTARY FORM OF FERPIC 


The details of an experimental device structure that operates in this 
manner are shown in Fig. 2. The device uses a 50 um thick plate of 
ceramic* having 65 percent lead zirconate and 35 percent lead titanate 
with two atom percent of lanthanum (designated 65/35-2 La). The 
grain size of this material is approximately one micron. Originally, the 
plate is poled to have a remanent polarization in the plane of the plate 
(L-state) by means of an applied field of approximately 20 kV/cm. The 
sample is poled before the photoconductive film and the transparent 
electrodes are applied. The specific poling conditions used for a given 
plate are adjusted to give a state of remanent polarization causing a 
half-wavelength of phase retardation when polarized light is transmitted 
at normal incidence through the plate. 

After the plate is initially in a condition such that all regions of the 
plate in the area used for storing the picture are in the L-state, the 
photoconductive film and transparent conductive films are applied. 
In the version of a ferpic outlined in Fig. 2, the photoconductive film is 
PVK? applied simultaneously to both sides of the plate by a dip-coating 
technique. Transparent conductive electrodes are next applied. In our 
experimental devices, half-transparent films of Cr-Au are vapor de- 
posited on the two surfaces; in practical devices, more transparent 
electrodes of tin oxide or indium oxide would be preferred. Fine wire 
leads are attached to these electrodes and used to connect the device 
to the voltage source used to supply a switching field in the thickness 
direction. 

Figure 2, in addition to showing the structural features of the 


* The ceramic used in our experiments was produced by Clevite Corporation, 
Cleveland, Ohio 44108. 

+ The abbreviation PVK stands for polyvinyl carbazole. The material used was 
obtained from Polyscience, Incorporated, P. O. Box 4, Rydal, Pennsylvania 19046. 
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Fig. 2—Construction details of an elementary ferpic and experimental arrange- 
ment used to store an image. 


device, shows the arrangement used to write the picture information 
into the ceramic plate. A high contrast transparency is placed immedi- 
ately in front of the ferpic and illuminated with collimated light. The 
voltage supply is pulsed on. In the regions where the light passes 
through the transparency, the photoconductive film becomes conductive 
and the field in the ceramic becomes large enough to produce switching 
in the form of 90° polarization rotation. (This mode of operation is 
described by Land and Thacher in connection with light gate structures 
using various configurations of metal electrodes.’ In ferpic devices, the 
90° polarization-rotation mode of operation is obtained by the use of 
photoconductive films and transparent conductive films working in 
combination with metal electrodes.) Localization of the switching field 
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requires that the dielectric constant of the ceramic, Koy, , be much 
larger than the dielectric constant of the photoconductor, Kpy, . For 
the materials used in our experimental devices (Kerr/Kpvx) & 400. 
For a 50 um thick ceramic plate, the writing conditions were the fol- 
lowing: (2) a white light flux of 2 mW/cm’, (72) a 200 V supply, and 
(272) a voltage pulse duration of approximately 1 minute. 

As already described, the stored image can be made visible by in- 
serting the ferpic between a polarizer and analyzer, as indicated in 
Fig. 1(c), and illuminating it with light from a collimated, monochro- 
matic light source. (The degree of collimation and monochromaticity 
involved are not critical. Most of our experimental work is done using 
white light from ordinary incandescent sources and, in fact, the two 
photographs included with this report were made with this sort of light.) 
If the phase retardation produced by regions in the L-state is \/2, 
efficient use of light is obtained when polarizer and analyzer are set 
parallel to each other and at an angle of 45° to the electric polarization 
vector in the ceramic. The regions in the L-state appear opaque; the 
regions switched to the T-state present no birefringence to the incident 
light and appear as bright areas to the viewer. 

The ferpic just described can be used as a kind of photographic plate 
capable of two-level (black and monochrome) image storage with a 
respectable degree of resolution. Figure 8 shows photographs made 


A B CDE 





(a) (b) 


LINES/mMm 
A-25.1 B-31.6 C-39.8 D-50.4 E-63.1 


Tig. 3—(a) The original resolution test chart as seen with a microscope using 
low power magnification. 

(b) The resolution test chart image stored in a ferpic and rendered ob- 

servable by means of a polarizer and analyzer used with the microscope. 
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using a low power microscope to view a pattern from a resolution test 
chart stored in a 50 um thick ceramic plate. The number of lines/mm 
in the individual columns marked A, B, C, D, and E is indicated in 
the figure. 

This photograph demonstrates that resolutions better than 30 lines/ 
mm are obtainable under the described conditions of operation. A 
number of experiments have been carried out to establish the principal 
factor limiting the resolution. Fringing fields within the 50 um thick 
plate appear to be the principal factor, since both the optical techniques 
and the photoconductor used have been demonstrated to have at least 
an order of magnitude finer resolution capabilities. The present ob- 
served resolution is already equivalent to 2 to 3 cycles of variation over 
a distance equal to the thickness of the plate. 

The device described in Tig. 2 has limited usefulness since, like a 
photographic plate, it can be used only once. The transparent electrodes 
on the surface prevent the polarization vector under these electrodes 
from being switched back to the L-state when a voltage is applied to 
the poling electrodes. We will next consider a form of ferpic that offers 
the capability of being electrically changeable. 


IV. AN ELECTRICALLY CHANGEABLE FERPIC USING AN INTERDIGITAL 
ELECTRODE ARRAY 


An interdigital electrode array deposited on one side of a ceramic 
plate provides in principle a means of switching the polarization vectors 
back into the plane of the plate after they have once been switched 
normal to it. An exploded view of the layer structure proposed for use 
in a changeable ferpic is shown in Fig. 4. In addition to enabling the 
plate to be switched into the L-state, use of the interdigital array has 
the great practical advantage of reducing the voltage required to pole 
in the longitudinal direction. As already indicated, 20 kV is required 
to pole over a 1 cm distance. If the line elements have the same spacing 
as the plate thickness, the same voltage supply (~200 V for a 50 um 
thick plate) can be used to establish both longitudinal and transverse 
switching fields. The obvious disadvantage to this approach is that the 
stored image will now be broken into a number of discrete lines. The 
extent to which the presence of the lines is evident and objectionable 
will depend on the details of use. 

The ferpic structure shown in Fig. 4 constitutes an electrically change- 
able image storage and display device that functions in three steps: 
(t) RESET, (27) WRITE and (277) vinw. The first two steps are illustrated 
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Fig. 4—Exploded view of the layered structure used to form an electrically 
changeable ferpic. 


in Fig. 5(a) and (b); the third step involves essentially the same ele- 
ments shown earlier in Fig. 1(c). In the REsET step the electrodes of 
the interdigital array are connected to the voltage supply and the 
resulting field switches the remanent polarization vectors predominantly 
into the plane of the plate. In this condition of polarization (L-state), 
every region of the plate has maximum birefringence for linearly polar- 
ized light incident normally. In the write step, the elements of the 
array are connected in parallel to one terminal of the supply and the 
other terminal is connected to the transparent conductive electrode. 
Light is directed at the area to be switched causing the photoconductive 
layers to conduct and the electric field in the ceramic under the illumi- 
nated area to exceed the coercive field. 

If the image to be stored in the ferpic is broken into elements, the 
spacing between lines of the interdigital array should be less than or 
equal to the size of an element. Writing the image an element at a time, 
as in a television picture, can be accomplished by modulating either the 
addressing light beam or the power supply. It is equally conceivable 
that the picture could be formed all at one time by projecting some 
desired image on the plate and switching the illuminated elements. 

The basic ideas of the interdigital array device were first demonstrated 
in a device structure using PVK films. The details of this interdigital- 
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Fig. 5—Details of the Reser and writs steps of operation for a ferpic utilizing 
90° polarization rotation in the ceramic plate. 


array ferpic are shown in [ig. 6, and an example of the image storage 
obtained is shown in Fig. 7. The upper part of Fig. 7 shows the original, 
simple, high-contrast image. The lower part shows the image observed 
through a low-power polarizing microscope. The image was stored in a 
50 wm thick ceramic plate with a square working area 0.8 cm on a side. 

The experimental structure of Fig. 6 differs from the original structure 
described in Fig. 4 by the inclusion of an additional transparent con- 
ductive electrode on the array side of the ferpic. This additional film is 
needed because the PVK films do not have a high enough conductivity 
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Fig. 6—Construction details of a ferpic using an interdigital array to switch the 
polarization into the plane of the plate. 


when illuminated to establish an equipotential region between the 
elements of the electrode array. While the addition of the transparent 
conductive electrode to the array side of the ferpic solves the problem 
of the low conductivity in the PVK, this modification has the disad- 
vantage that it hinders erasure. The PVK film between the transparent 
electrode and the array is now subject to breakdown field strengths 
when the RESET voltage is applied. In addition, experiments with PVK 
films on devices using the 90° polarization rotation have indicated that 
the PVK film constrains the motion of domains, probably through a 
mechanism of trapping polarization charges at the ceramic-PVK 
interface. 

Our most recent experimental efforts have resulted in two significant 
developments, one related to the performance of photoconductive films 
used in interdigital-array ferpics and the other related to a new version 
of a ferpic. 

In connection with photoconductive films, we have been successful 
in realizing CdS films with a high light-to-dark conductivity ratio and 
a high conductivity when iluminated. These films have been sputter 
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Fig. 7—A simple, high-contrastimage stored”as a variation injbirefringence in 
ferpic using an interdigital electrode alray: upper, the original image; lower, the 
image viewed in the ferpic i 


by means of 4 polarizing microscope. (The horizontal 
lines apparent in the photograph are formed by the electrodes 


of the interdigital 
array and these are spaced 250 em apart on the actual device. ) 


FERROELECTRIC PICTURE DEVICES 963 


deposited on device structures consisting of ceramic plates with inter- 
digital arrays vapor deposited, in registration, on both surfaces. Images 
with nearly the quality of that shown in Fig. 7 have been successfully 
stored in the CdS film devices, and, what is more important, these 
devices have demonstrated a significantly greater degree of changeabil- 
ity than interdigital devices using PVK films. The experiments with 
these devices demonstrate that the ceramic and the CdS films can work 
together successfully with interdigital arrays to provide the intended 
mode of operation. 

In connection with the new form of ferpic, a refined structure has been 
developed which eliminates the necessity of having a separate set of 
electrodes or interdigital arrays in order to switch the ceramic into the 
L-state. In this new device structure, a thin ceramic plate and photo- 
conductive film are sandwiched between transparent electrodes, and 
the stack is bonded to a relatively thick, transparent substrate. The 
ceramic plate is put in tension along one direction by slightly flexing the 
substrate. The direction of the tension axis in the ceramic becomes a 
preferred direction along which the polarization vectors in individual 
domains tend to align. (Because of the use of a permanent strain to 
establish this preferred direction, the device is called a “strain-biased 
ferpic.”’) In the strained condition, the ceramic can be switched between 
two states, corresponding to an L- and T-state, by the application of 
fields in the thickness direction. As in the earlier versions of the device, 
only the illuminated regions can be switched to the T-state. In order 
to reset the whole device to the L-state, the whole active area is flooded 
with illumination while the reset voltage is applied. An important fea- 
ture of the new device is that its structure permits localized switching 
to the L-state as well as to the T-state. A paper describing in detail 
the structural and performance features of the strain-biased ferpic is 
presently in preparation and will be published at a later time. 


V. OPTICAL DISPLAY SYSTEMS WITH LASER BEAM ADDRESSING 


Figure 8 shows the essentials of a projection display system using a 
ferpic in combination with a laser beam addressing module and a 
source of viewing light. A dichroic mirror that transmits at the projec- 
tion wavelength and reflects at the write wavelength is used to enable 
the laser subsystem to be positioned off the main axis. The arrangement 
shown in the schematic drawing is intended to be an electronically 
changeable, projection display. In this display system, there are three 
stages to a complete cycle of operation: (2) RESET, (77) WRITE and (i722) 
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Fig. 8—A projection display system using a ferpic in combination with a laser 
beam addressing module and a source of viewing light. 


view. During the rEsET stage, the viewing light and laser beam are off 
and the whole ferpic is switched to the L-state. During the writ 
stage, the viewing light remains off while the picture is formed an ele- 
ment at a time by moving the laser beam successively through all the 
address points. The laser beam is steered by an X-Y deflection system 
which could be an ultrasonic light deflector system of the sort recently 
developed for use in an optical memory system* or a mechanical scanning 
system. The choice exists of controlling the switching of a picture ele- 
ment by modulating either the intensity of the laser beam or the in- 
tensity of the voltage pulses supplied to the electrodes of the ferpic. 

Once the picture has been formed, the viewing source is flashed on. 
An important advantage of this display system is that, once written, 
the picture can be held without any further expenditure of electrical 
power. This feature would make a ferpic display attractive for many 
applications now met using storage cathode-ray tubes. The fact that 
the basic light controlling element has intrinsic memory has important 
implications for practical applications. As a consequence of this intrinsic 
memory the display is able to hold a given picture for an indefinitely 
long time. Furthermore, the picture is not volatile in the sense that it 
is retained in the event of a power failure. Depending upon the details 
of construction and associated circuitry, it may be possible to break up 
the stored picture into individual lines or elements and use a “periodic 
update’ mode of operation of interest in connection with bandwidth- 
reducing techniques of picture transmission. 
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In the display system proposed here, the image is formed one picture 
element at a time by a scanned laser beam, but is viewed or projected 
as a whole using incoherent light in a conventional projection system. 
This separation of the scanning and viewing functions allows a relatively 
low-power laser, in any convenient wavelength range, to be used for the 
scan, while a powerful, efficient incoherent source provides the actual 
viewed light. 

We are not able at the present stage of the device development to 
state whether or not the device will have the lifetime and switching 
speed capabilities required to make it useful in real-time display sys- 
tems. (For this sort of application, ferpics would be in competition with 
devices like the trrus tube’ and the n1porHor tube.°) However, there 
are applications that can make use of a slowly-scanned, high-resolution 
image storage device that can hold an image for an arbitrarily long time 
and either continuously display it by an optical projection system or 
project it onto some form of hard-copy print-out. This sort of applica- 
tion would not require the high cycling rates or extreme lifetimes of a 
device used in real-time display systems. 

With regard to obtaining maximum lifetime in a ferpic, there is an 
important point that deserves explicit statement: It is not necessary to 
rotate the polarization vector through a full 90° in order to obtain a 
useful effect. Operation under conditions producing full 90° polariza- 
tion-rotation provides the means of obtaining the maximum bire- 
fringence change with a minimum material thickness.* On the other 
hand, such a large change in polarization direction produces strains 
that could lead to premature failure of the ceramic. In devices using 
ceramics with sufficiently low optical loss,’ there exists the alternative 
of switching the average polarization through an angle much less than 
90°, and then using increased thickness to obtain the desired half- 
wavelength retardation change. 


VI. REFINING THE BASIC DEVICE TO OBTAIN GRAY-SCALE AND COLOR 
CAPABILITIES 


A ferpic, as described in this paper, is basically a two-level (black 
and monochrome) image storage and display device. However, modifica- 
tion of the device structure and system configuration to include color 


* According to Land and Thacher,! a An of —0.022 is obtained in 65/35 — 2 La 
ceramic at maximum remanent polarization. This An requires only 16 yum of 
material to produce a half-wavelength of retardation for 6328 A light. 

+ A new hot-pressed, lead zirconate-lead titanate ceramic composition has been 
developed at Sandia Laboratories with lower optical loss and improved electrical 
switching characteristics.” 
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and a gray-scale appears to be straightforward. For example, the system 
of Fig. 8 can be modified for color by using a sequential-field, color-scan 
technique. Instead of one source illuminating the ferpic, there would 
be three light sources (red, blue, green) with appropriate means to 
strobe these in sequence. The ferpic would operate as already described 
except that there would now be three different RESET operations per 
frame, with a voltage applied to produce a retardation of a half-wave- 
length for the color flashed during the view stage. In practice the opera- 
tions RESET-WRITE-VIEW would be repeated in sequence for each color. 

Modification of the basic system of Fig. 8 to obtain a gray scale re- 
quires that intermediate states of birefringence—and hence polarization 
—hbe reproducibly attainable, so that the ceramic plate functions as an 
intensity modulator and not just as a simple shutter. Experiments’ with 
elementary, light-gate devices using the fine-grain ceramic have shown 
that it is possible to obtain reproducible partial switching with suitable 
circuitry. The development of a ferpic with gray-scale capability will 
depend strongly on the switching properties of the ceramic and the 
photocurrent response characteristics of the photoconductive films 
with which the ceramic is used. It is still too early to state uniquivocally 
that a gray-scale device can be obtained with all other desired char- 
acteristics, but the possibility is certainly there. 


VII. CONCLUSION 


The recent development of fine-grain, electro-optic ceramic materials 
has provided the means of realizing ferroelectric devices capable of 
storing high-contrast images under the control of electrical voltages. 
The image is stored as a variation of birefringence in a thin, ceramic 
plate and can be viewed directly by suitably polarized transmitted 
light or projected onto a viewing screen. Experimental devices have 
demonstrated a resolution capability of approximately 50 lines/mm 
in 50 wm thick ceramic material and have been able to hold the image 
with no apparent change for times of the order of several months. While 
our early experimental devices have had only limited electric change- 
ability, the basic material system is certainly capable of providing this 
essential feature. Images stored in these devices have been projected 
and viewed by means of simple, desk-top, commercially available, 35 
mm projection displays (modified by the inclusion of a polarizer and 
analyzer). Display systems of the sort considered appear to be well- 
suited to applications such as high-resolution, slow-scan, and document 
display. 
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Some Mathematical Properties of a Scheme 
for Reducing the Bandwidth of Motion 


Pictures by Hadamard Smearing 


By E. R. BERLEKAMP 
(Manuscript recieved September 29, 1969) 


M. R. Schroeder recently proposed a scheme for compression of motion 
picture data by taking the difference of two successive frames and then 
smearing.’ The smearing is accomplished by a Hadamard matrix. 

If the Hadamard matrix is of a certain particularly well-understood 
type, then we show that if the input differential picture consists of a small 
odd number of large pulses of identical magnitudes (but arbitrary signs), 
then the output will consist of three components: 


(2) Large pulses of equal magnitude and the correct signs, matching 
each of the input pulses. 

(17) One additional “stray” large pulse, of magnitude equal to the 
others, but located at a point where the input was zero. 

(12t) Scattered pulses of amplitude low relative to the pulses of types 
t and 11, but so numerous that they consume (x — 2)/m of the total energy 
of the output differential picture. 


We giwe an explicit formula for the amplitude of each of these pulses. 


The problem of determining the distributions of all possible outputs 
of the proposed system for other classes of inputs is shown to be equivalent 
to the unsolved problem of finding the weight enumerators for the cosets 
of the first order Reed-Muller codes. 


I. INTRODUCTION 


The fact that successive frames of a motion picture are often very 
nearly alike has led to the consideration of schemes which transmit, 
for each point of the picture, the difference between the amplitude 
of the present frame and the amplitude of the previous frame. Since 
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this differential picture* is frequently zero at many points, there is 
reason to hope that the bandwidth required for transmission of the 
differential picture could be greatly reduced by appropriate coding. 

One such coding scheme which has been considered by W. K. Pratt, 
J. Kane, and H. C. Andrews,’ and refined by M. R. Schroeder’ is the 
following: let the differential picture be represented by a real n-dimen- 
sional vector, v. (For example, if the picture is represented by a 100 X 
100 grid, then » = 10000.) Let 3c be an n X n Hadamard matrix, 
which is a self-orthogonal real matrix all of whose entries are +1, 
and let the smeared differential picture (or transformed differential 
picture), x, be defined by x = 3¢v/(n)*. Let Q be the power-preserving 
clipping operator, defined by 


ox = (le ee (x) 
On 


where sgn (x) is the n-dimensional vector whose 7th component is 
+1 or —1, depending on the sign of the zth component of x. Since we 
wish the quantizer to have only two outputs, we cannot take sgn (0) =0. 
Unless stated otherwise, we assume that sgn (0) is undefined. Schroeder 
has asserted that the vector y = Qx = Qscv/(n)? provides an appropriate 
“encoding” of the differential picture v. To ‘‘decode” one computes 
z = 1/(n)*sc'y. The question to be studied in this paper is the quality 
of z as an approximation to v. 

Some of the heuristic arguments favoring this proposed scheme are 
the following: Since successive frames are frequently very similar, the 
differential picture will have near-zero amplitude at most points. In a 
typical case when the camera is focused on a moving subject and a 
fixed background, the differential picture will be identically zero at 
all background points. If the subject and the background each has 
uniform color (the simplest plausible case), then the differential picture 
will be nonzero only at those points on the boundary of the subject. 
Furthermore, all of the nonzero amplitudes in the differential picture 
will have equal magnitudes, although their signs will depend on whether 
they are on the leading or trailing edge of the moving subject. The 

* To be precise, the ‘‘differential picture’’ should consist of the difference between 
what the present frame actually is and what the decoder thought the last frame was. 
Since all of the errors in the system are assumed to arise from quantization, rather 
than from any sort of unpredictable noise on the communications channel, the 
encoder may include a replica of the decoder, thereby enabling it to compute what 
the decoder thought the last frame was. Each transmitted differential picture then 
includes an attempt to correct the cumulative effects of all previous errors. In this 
paper, we study only the quantization noise introduced in the encoding and decoding 


of a single differential picture, ignoring the complicated dynamic questions which 
arise when one studies the behavior of the system during several successive frames. 
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conventional manner of encoding the differential picture is to quantize 
the amplitude at each gridpoint. This scheme will introduce no quantiza- 
tion error at all on the background points, which have zero amplitude, 
but a relatively high number of quantization levels may be required 
to keep the quantization errors along the outline of the subject down to 
a tolerable level. The Hadamard transform of the differential picture, 
on the other hand, will have its energy spread out relatively uniformly 
among the grid points. A coarse quantization of the smeared differential 
picture will introduce quantization errors throughout the differential 
picture in a relatively uniform manner. When the quantized smeared 
differential picture is unsmeared, the quantization errors, being some- 
what independent, should tend to cancel out. It is thus hoped that a 
coarser quantization of the smeared differential picture might yield 
a decoded differential picture of the same fidelity as a substantially 
finer quantization of the original, unsmeared differential picture. 

A somewhat more theoretical discussion of the effects of quantization 
in the Hadamard transform domain is given in Section VI of Pratt, 
Kane, and Andrews.” The main result is that the Hadamard trans- 
formation preserves energy. Hence, if the amplitudes at the various 
points in the transformed differential picture are independent zero mean 
gaussian random variables, then the energy of the noise introduced by a 
two-level quantizer would be (# — 2)/z of the total energy in the output 
differential picture, both before and after unsmearing. From this view- 
point, the major attraction of smearing is that it distributes the quantiza- 
tion noise uniformly throughout the picture. If our simple model of a 
differential picture (which has nonzero amplitude only along the 
outline of the subject) is coarsely quantized, then all of the quantiza- 
tion noise appears on the outline, where it will tend to blur the subject. 
However, if this differential picture is smeared, coarsely quantized, 
and unsmeared, then its quantization noise should be evenly distributed 
throughout the subject and the background. 

We shall now study the relationship between the original differential 
picture, v, and the decoded differential picture, z. It is clear that the 
energy in the vector z is always identical to the energy in the vector v. 
Hence, for purposes of analysis, it is easiest to compute z according 
to the formula 


z = A&C’ sgn Kv 


where for each frame A is a non-negative scalar chosen to make the 
energy in z equal to the energy in v. In this paper, we often omit the 
actual calculation of A. 
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In the case where v has only one nonzero component, then sgn 3Cv = 
5ev and z = A3c'stv. Since 3Cc’3C = nI (where I is the n X n identity 
matrix) it follows that z = v. In other words, the system transmits 
a single pulse without error. 

On the other hand, when v has only two nonzero components, then 
the component with the larger amplitude dominates the component 
with the smaller amplitude. In this case z again has a single nonzero 
component, even though v had two nonzero components. However, 
the choice v = [1, 1—e, 0, 0, 0, --- , 0] results in an ambiguity. If 
we instead write v = [1, 1, «, 0, 0, 0, --- , 0], then we may actually 
find that, in the limit as e > 0, z > [1, 1, 0,0, --- , 0]. Ifv = [1, 1, 0, 0, 
-++ , QO], then z is undefined because it depends on sgn 0, which is either 
plus or minus. In fact, z is undefined whenever v has an even number 
of nonzero components, all of equal magnitude; but this difficulty might 
be removable by adding an appropriate background noise function 
into v, or by choosing sgn(0)’s independently at random. To avoid 
the necessity of such considerations, we devote our primary attention 
in this paper to the case in which v has an odd number of nonzero com- 
ponents, all of unit magnitude (but arbitrary sign). In this case, every 
component of 3Cv is an odd integer. Since every component of 3tv 
must therefore have magnitude at least 1, the sgn function is defined 
and the analysis remains valid in the presence of a small background 
noise in any or all components of v. 


Il, HADAMARD MATRICES 


The requirement that any three rows of a Hadamard matrix be pair- 
wise orthogonal leads to the immediate conclusion that if n > 2, then 
ann X n Hadamard matrix can exist only if n is a multiple of 4. The 
question of whether or not there actually do exist n X n Hadamard 
matrices for all = 0 mod 4 is now one of the most intriguing unsolved 
problems in combinatorial theory. Many ingenious constructions have 
been proposed, and several of them succeed in obtaining Hadamard 
matrices for an infinite number of (scattered) values of n. For example, 
if n is a multiple of 4 and n — 1 is a prime-power, then a well-known 
construction based on quadratic residues in the finite field GF(n — 1) 
yields an n X n Hadamard matrix. Many other constructions for 
Hadamard matrices are given in Chapter 14 of Hall,® and more recent 
constructions have been presented by Spence,* Goethals and Seidel,’ 
and Wallis.°’’ The smallest value of n = 0 mod 4 for which no n X n 
Hadamard matrix has yet been constructed is n = 188. 
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For many values of n, there exist Hadamard matrices with additional 
structure. For example, some Hadamard matrices have the property 
that the first row and first column consist entirely of -++-1’s, and the 
remaining (n — 1) X (n — 1) submatrix has the property that each 
of its rows is a cyclic shift of the previous row. Such matrices are called 
cyclic Hadamard matrices. They are known to exist whenever n — 1 
is prime, or when n — 1 is the product of twin primes, or when 7n is a 
power of 2. A computer search by Thoene & Golomb® and some cal- 
culations by Baumert’ have shown that no cyclic Hadamard matrices 
of other orders less than 1000 exist, with the possible exceptions of 
n = 400, 496, 628, 652, 784, 976. 

Irom the viewpoint of an algebraic coding theorist, a shortened 
Hadamard matrix (obtained from a standard Hadamard matrix by 
multiplying each row by an appropriate sign to make the first column 
all +1’s, and then deleting the first column) is equivalent to an equi- 
distant binary code. The n codewords are taken as the rows of the 
shortened Hadamard matrix, with each +1 replaced by 0 and each 
—1 replaced by 1. Since the dot product of any pair of rows in the 
shortened Hadamard matrix is —1, the distance between any pair of 
words in the binary code is (x + 1)/2. Further discussion of such codes 
is given in Section 13.5 of Berlekamp.”° 

The best-understood class of equidistant binary codes is the maximal- 
length shift-register codes, which are also called shortened first-order 
Reed-Muller codes. In addition to being cyclic and equidistant, they 
are linear over the binary field, which means that the component by 
component binary sum of any pair of codewords is another codeword. 
Stated in terms of the original Hadamard matrices, this property means 
that the componentwise product of any pair of rows of the Hadamard 
matrix is another row of the same Hadamard matrix. Although Hada- 
mard matrices with this property are relatively rare, they exist for 
every 7 which is a power of 2. Because of their correspondence to the 
Reed-Muller codes, these matrices are comparatively well-understood, 
and we shall henceforth confine our discussion to Hadamard matrices 
of this type. Such matrices may be taken as cyclic. 


Ili. THE INDUCED COORDINATIZATION 


A 2* x 2° Hadamard matrix corresponding to a Reed-Muller code 
induces a (non-unique) coordinatization on the 2" components of each 
row, associating each component with a k-dimensional vector over 
GF (2). A set of 2’ coordinates is said to form an affine subspace iff the 
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corresponding 2’ k-dimensional vectors form an affine j-dimensional 
subspace over GF(2). Similarly, a set of m coordinates are said to be 
linearly (or affine) independent iff the corresponding k-dimensional 
vectors are linearly (or affine) independent. 

If a set of m binary k-tuples a; , a2 , +: @, are affine independent, 
then they span an (m — 1)-dimensional affine subspace, each element 
of which has a unique representation of the form 


for some 7. A set of binary vectors are affine independent iff no subset 
containing an even number of vectors sums to zero. An affine basis for 
the set of all 2° binary k-dimensional vectors may be selected in various 
ways. The “standard” basis consists of the all-zero vector and each 
of the k “unit” vectors. In general, any k + 1 affine independent vectors 
may be chosen as a basis. 

If k is very large, then the probability that a randomly chosen set 
of k + 1 k-dimensional binary vectors will be affine independent is 

*., (1 — 27), or about 29 percent. The probability that k randomly 
chosen vectors will be affine independent is about 58 percent; for 
k — 1, itis 76 percent. If m < k, then almost every set of m different 
k-dimensional binary vectors is affine independent. 

The first row of the 2° x 2* Hadamard matrix may be taken as all 
+1’s. The 2°~' +1’s in each of the other rows occur in the components 
corresponding to some (k — 1)-dimensional subspace of the k-dimen- 
sional binary vectors, and the —1’s occur in the components correspond- 
ing to the complementary (k — 1)-dimensional affine subspace. 

The coordinatization induced by the Hadamard matrix is invariant 
under all changes of affine basis. When translated into coding ter- 
minology, this is equivalent to Theorem 15.35 of Berlekamp.’° 


IV. MAIN RESULT AND DISCUSSION 


4.1 Theorem 
Let v be a 2"-dimensional vector whose only nonzero components 
are 2m + 1 units occurring at components corresponding to k-dimen- 
sional binary vectors a , a , @2, °** Gem Which are affine independent. 
Let the vector z be defined by the equation 
Zz = 5’ sen Sv. 


Then the value of zs, the component of z corresponding to the k- 
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dimensional binary vector 8, is given by 


I if 8 is not in the affine subspace spanned by 


Qo > &y 3&2, °** » Hom 
&B 


S2*?"(2m — 27)!(27)! . - ae 
mG sa ee 


and the sign is given by 


27+1 


S = (-D)' TT... 


4.2 Remarks 


We first notice that the answers do not significantly depend on k, 
but on 2m + 1, the number of units in the input vector v. 

Since z is defined by an equation of the form z = &’y, where the 
energy of y is n, the energy of zisn? = 4°. Fori = 0, 1, --- 2m we have 


jn = [are(OO) 


For large m, | 2a, |’ is closely approximated by 2”°/xm. Thus only 
2/m of the total energy in z is located in those components in which v 
has units; the remaining (rt — 2)/z of the total energy is distributed 
throughout the affine subspace. For example, if m = 5, k = 10, we 
have the following output: 

















Corresponding aa) Number of such 

! Input Value | Output Value Value Coordinates 
1 252 16 X 3969 C= AL 
0 28 16x 49 | (3) + Co) = 220 
0 12 16 X 9 | (5) + Cy) = 792 
0 252 16 X 3969 Gy 4 
Totals 16 X 2"° 1024 

For all m, we notice that if 8 = )/77) a; , then 
| 2p | = | ea; bs 


In other words, if the input to the coding-transmission-decoding system 
consists of 2m + 1 pulses of equal magnitudes (and arbitrary signs) 
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located at positions which are affine independent then the output 
may be written as the sum of the following three terms: 


(t) 2m + 1 pulses of equal magnitude (and correct signs) matching 
the input. 

(77) One stray pulse of the same magnitude as the 2m + 1 correct 
ones. 

(wiz) Other errors scattered throughout the affine subspace, having 
maximum amplitude [1/(2m — 1)] times as large as the correct pulses. 


Since each of the errors of type 727 has a relative amplitude approaching 
zero for sufficiently large m, one might consider the proposed system 
“successful” in some sufficiently broad sense of that term even though 
these errors consume (t — 2)/m of the energy in the output signal. 
The error of type 27 poses a different problem, even though it consumes 
a negligible fraction of the energy. Further research may be required 
to decide whether these difficulties might be removed by replacing the 
operator Q by another quantizer with more levels. Further study will 
also be required to determine how these quantization errors propagate 
in successive frames in a dynamic system. (See footnote on page 970.) 


V. RELATIONSHIP TO THE WEIGHT ENUMERATION PROBLEM FOR RM COSETS 


In the previous sections we calculated the vector z = 4c’Qs3cv for 
certain specific choices of v. These vectors v were chosen to be “‘typical”’ 
in some intuitive sense, and yet sufficiently simple in form to enable 
us to carry through the calculation in closed form, even when the 
dimensions of the 3C matrix (n X n) were large. 

Instead of assuming some ad hoc form for the vector v, we might 
instead ask, what is the range of the operator 3¢'Q3C? Except for the 
scalar factor, this is equivalent to determining the range of the operator 
5c’ sen. For, if there exists a vector x such that z = JC’ sgn x, then 
n sgn X = sgn 5Cz and z = (3C’ sen 3Cz)/n. Hence, every vector in the 
range of 3C’ sgn is proportional to a vector in the range of 5C‘Q3C, and 
every vector in the range of 3C‘Q3C is a stationary point of this operator. 
Stated another way, (3¢'Q3c)” = 3c’Qse. 

In more practical terms, an investigation of the vectors in the range 
of 5C’ sgn is actually an investigation of the ensemble of possible dif- 
ferential pictures which the proposed system might produce as output. 
This set is identical to the set of differential Picnics which the system 
will encode and decode with zero error. 
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If # is ann X n Hadamard matrix, then there are 2” vectors in the 
range of 3¢’ sen. For reasonable values of 7, 2” is so large that a complete 
listing of all of these vectors is not feasible. Fortunately, however, 
these 2” vectors fall into a relatively small number of classes, each class 
consisting of those vectors which have the same distribution of mag- 
nitudes of component amplitudes. The problem of determining the 
possible distributions of magnitudes of the component amplitudes of 
a vector in the range of JC’ sgn turns out to be identical to the problem 
of determining the weight enumerators for the cosets of the Reed- 
Muller code. This equivalence is seen as follows: If y is a real vector 
whose components have unit magnitude, then the number of components 
of 3¢’y with magnitude | A | is the number of rows of 3C whose dot 
product with y is +A. On the other hand, if we convert 1 to 0 and 
—1 to 1, changing 3¢ to S and y to R, then the weight of the binary 
vector sum of R and each codeword of the extended max-length feed- 
back shift register code gives the distance between the received word 
R and the corresponding codeword, and the enumeration of all of these 
weights for a particular R is the weight enumerator for the coset con- 
taining R. If c is a binary codeword for which w(c + R) = d and 
if u is the real vector of +1’s corresponding to c, then u and y disagree 
in d components and agree in » — d components. Therefore, 


wey =n — 2d. 


Since the first order Reed-Muller code contains both the codewords 
in the extended maximum length feedback shift register code and their 
complements, there is a one to one correspondence between RM cosets 
with weight enumerator d, , d,, :-- , d, and real vectors in the range 
of 3C' sgn having magnitudes of component amplitudes distributed as 
follows: d; components with amplitudes +(n — 27) for 7 = 0, 1, 2, 

- ,n/2 — 1, and d,,/2/2 components with amplitude zero. 

The coset weight enumerators for first order RM codes of lengths 
up to 16 have been determined by R. Dick and N. J. A. Sloane.” 
Their results, and the corresponding distributions of magnitudes of 
amplitude components of the output of the differential picture encoding- 
decoding system are shown in Table I. Those rows which are predicted 
by our main theorem have been checked, and the relevant values of 
m have been listed. 

The coset weight enumerator for the first order RM code of length 
32 was determined by Berlekamp and Welch,” and the results are shown 
in Table IT. 

The coset weight enumerators for first order RM codes of length 


TasBLte I—Coset WricHT ENUMERATORS FOR First OrppR RM Copss or Lrenetus 4, 8 AND 16 




















n= 4 
Amplitude +4 2 0 
Number 
of vectors 2 Value of m if 
in range Number 0&4 1&3 (entries predicted by 
of 3C* Sgn of cosets Weight halved) main theorem 
8 1 1 3 0 
8 1 4 1 
n=8 
Amplitude +8 +6 +4 +2 0 
0 1 2 3 
& & & & (entries 
Weight 8 7 6 5 halved) 
16.1 1 1] 7 0 
16.8 8 1 7 — 
16.7 7 4 4 1 
eo Ss es est et 
n= 16 


Amplitude +16 +14 +12 +10 +8 +6 44 +2 0 


0 1 2 3 4 5 6 7 8 
& & & & & & & & (entries 
Weight 16 15 14 13 12 11 10 9 halved) 

32.1 1 1 15 0 
32.16 16 1 15 
32.120 120 1 7 8 
32.560 560 1 3 12 
32.840 840 2 8 6 
32.35 35 4 12 1 
32.448 448 6 10 2 
02.28 28 16 














826 


OLGT LSNONV-ATINL “IvVNUNOL IVOINHOAL WALSAS TIA WHL 
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>64 have not yet been determined. This problem definitely merits 
further research. 


VI. ACKNOWLEDGMENTS 


I am indebted to Mr. J. R. Pierce for suggesting this problem and 
to Mr. N. J. A. Sloane for suggestions which simplified the proof of the 
identity in Appendix A. 


APPENDIX A 


A Sketched Proof of Main Result 


One of the implications of our main theorem is that if @ lies outside 
of the affine subspace spanned by the relevant a’s, then zg = 0. A 
generalization of this result is the following: 


Theorem: If the only nonzero components of v all ie in a (k — 1) 
dimensional affine subspace and 8 lies outside of this subspace, then zg = 0. 


Proof: Using the elementary properties of RM codes and affine 
subspaces, the original 2° X 2° Hadamard matrix may be partitioned as 


_[s S| 
ig E ~g 


where G is a 2""’ X 2*"* Hadamard matrix. In terms of this partition, 
the last 2""’ components of v are zero and JCv is of the form [w, w]' 
for some appropriate 2*~’-dimensional vector w. We then obtain 


5c’ sen [w, w]’ = 2[z, O]' 
where 
z’ = G' son w'. 


By repeated application of this theorem, we deduce that z, = 0 
unless 8 is in the affine subspace spanned by the coordinates of the 
nonzero components of v. If the coordinates of the nonzero components 
of v span a d-dimensional affine subspace, then an appropriate change 
of coordinates allows us to work with a 2° X 2° Hadamard submatrix, 
which is also Hadamard. The original output z vector merely gains a 
factor of two for each omitted dimension. 

Applying these arguments to the main theorem of the text allows us 
to confine our attention to the case when k = 2m. 

Since the RM code is invariant under the full affine group, there is 
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TaBLE II—Cosrer Wericut ENUMERATORS FOR [rst ORDER 


Boolean 
function 
for coset* 


Téven Cosets 


2345 

2345&12 

2345423 
2345&23&45 
2345&12&34 
2345&123 
2345&123&12 
2345&123&24 
23454123414 
2345&123&45 
2345&123&12&34 
2345&123&14&35 
2345&123&12&45 
2345&1238&24&35 
2345&123&145 
2345&123&145&45 
2345&123&145&24&45 
2345&123&145&35&24 
123 

123&45 

123&14 

123&14&25 

123&145 
123&145&23 
123&145&24 
123&145&23&24&35 


12 
12&34 


RM Copes or LENGTH 32 





Number of 
such cosets 








496 X 120 
496 X 35 
496 X 28 
496 X 840 
17360 X 2 
17360 X 24 
17360 X 18 
173860 * 192 
17360 XK 32 
17360 X 72 
17360 X 576 
17360 X 96 
17360 K 12 
13888 X 320 
18888 X 32 
13888 X 480 
13888 X 192 
155 X 8 
155 X 512 
155 X 168 
155 X 336 
868 X 32 
868 X 320 
868 X 480 
868 X 192 

1 X 155 

1 X 868 
1x 1 


496 X 1 


rFocoooocoocooooococncoeoceoooooooo|| NO 


eS) 


ooococcoco“cooocooooocoocooooo ooo oOF | LON 


iS) 


oqoocoocoococooorcooo°coocoooooorooooo |! OF 


bo 


OoocoocorcocoooeoroqocoococoorCcoOorFrOo | Om 


Weights 


2. 


8 
4 
0 
2 
0 
0 
0 
0 
0 
2 
1 
0 
0 
0 
1 
0 
1 
0 
0 
0 
0 
0 
2 
0 
0 
1 
0 
0 
4 
0 
0 


Value 
of m 
if pre- 
dicted 





015 
014 
0 12 
0 10 
8 14 
3 16 
4 14 
416 
414 
4 12 
4 12 
8 14 
8 16 
12 16 
6 15 
6 15 
6 13 
10 15 
7 0 


—_ 
o 
i] 
oe} 


ee Lael 

SCAPOONDO® 
ioe) 

oqooomoooo 


SOCORRO RFRCOROCRFWOFOCON EF PNORFONOWNO 
—_ 


10 12 14 16 
22 20 18 (halved) 








* These functions are written in an abbreviated notation. For example, the 
second line, 2345&12 indicates that this equivalence class of cosets includes one 
coset whose members are the 64 Boolean functions of the form X2X3X4X5 + 
XX + AX, + BX, + CX3 > DX, + EX; ++ F, where A, B, C, D, E, and ¥ 
are arbitrary binary elements. 


no loss of generality in assuming that a, = 0, that a,, a, °° 


are unit vectors, and that 
ve, = +1 for 1=0,1,--- , 2m. 
Any other case can be reduced to this case by an appropriate affine 


transformation of coordinates. 
We now determine the distribution of the vector 


x = Sv. 


» Gam 


REDUCING 


TaBLeE [I—Cont’d 
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138 5 7 91113 15 
Odd Cosetst 31 29 27 25 23 2119 17 
— 82x 1 100000 0 81 
12 32 X 155 000413 0 0 28 
12&34 32 X 868 0 0 0 0 0 610 16 
123 4960 xX 1 010000 7 24 
123&12 4960 X 7 004100 383 4 24 
123.&14 4960 X 84 00011 2 6 22 
123&45 4960 X 64 00003 1 7 O21 
123&14&25 4960 X 336 00 0 0 0 610 16 
123&12&45 4960 X 448 00 001 818 15 
123&12&34. 4960 X 84 00002 4 4 22 
123&145 27776 X 10 00011 +012 18 
123&145&12 27776 X =66 001 0 03110 20 
123. &145&23 27776 X 80 00010 83 9 19 
123&145&45&23 27776 X 16 0001 041415 15 
123&145&24 27776 X 180 00 00 2 210 18 
123&145&24&23 27776 X 240 00001 5 7 #19 
123&145&35&24 27776 X 240 00001 8138 15 
123.&145&35&24.&23 27776 X 192 00 0 0 0 610 16 
123&145&45&35&24&23 | 27776 X 60 0000 0 416 12 


t All functions representing odd cosets also contain the term 12345, which is not 
shown in this table. 


Since x is the sum of the all-ones vector (corresponding to the column 
of 3C associated with a) = 0) and 2m columns of 3¢ which correspond to 
linearly independent codewords in the RM code, it is readily seen that 
there are (°”) components of x which have the value 2m + 1 — 22. 
It follows that y = sgn x has 0", 7") +1’s and >.2",,(") —I’s. 

For convenience, we may partition the components of y into 2m + 1 
subsets, each of which corresponds to the components of x with the 
same value. We call the set which consists of (°") components where x 
had value +1 the “significant”? set of components. The sets of in- 
significant components may be matched up in pairs; the set where x 
had value 2m + 1 — 2t¢ being matched with the set where x had value 
2m + 1 + 2t. Each of these matched sets contains (*") components. 

We now let 6 be a typical 2m-dimensional binary vector, which is the 
sum of 2j7+1 a’s. By an appropriate permutation of basis vectors, 
we may assume that 


27 27 
B=a+ Dia: = Dia. 
i=1 i=1 


The equation z = 3c’y now allows us to compute zs as the dot product 
of a particular row of 3c’ and y. The equation 6 = }>.?/, a; and the 
correspondence to RM codes allows us to express the particular row 
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of 3C’ under consideration as the componentwise product of the first 
2j rows of 5¢’, ignoring the zeroth row, which is all plus one. Symbolically, 
if we let r, , f2, °** , Yem denote the first 2m rows of 3C’, then 


Ze = (1, © re & +++ & es) sgn (¥ +.) 


t=1 
where ‘“&’’ denotes the componentwise product. Since 27 is even, we 
may also write 
Z 2m 
2s = [(—n) & (—r) & +--+ & (—1,)] sgn r) ‘ 

The dot product is the sum over all 2°” components of the component- 
wise product of (r; ® r. @ -+: ®@ re;) and y = sen (>.2", r,). Since 
there is cancellation of the summands coming from matched sets of 
components into which we partitioned y, we need only consider the 
?™) “significant”? components. On each of these, y takes value +1, and 
the problem reduces to the following: Given a 2m X (") matrix, whose 
columns represent all ways of distributing m plus ones and m minus 
ones among 2m rows, compute the sum of the entries in the component- 
wise product of the first 27 rows of this matrix. The solution is obtained 
by noting that if there are 7 minus ones in the first 27 rows, then the 
componentwise product is (—1)* and this happens in (')?"=2') columns. 


Therefore, 
i297 — 2 
ze= 2 (-1) ?)( is '). 


m—t 
Having already explained the other factors in the more general version 
of the theorem stated in the text, the theorem is reduced to the identity, 


(29) (2m — 27) 2 (—1)’@m — 24)! 29)! 
re) a jim'(m—p! 


Multiplying through by (m!)’/(2m — 2j)1(2j)! reduces this to the 
equivalent identity, 


een(™(,” ) 2 en"). 


whose proof is given by Riordan (p. 14, line 7 from bottom)."* Q.E.D. 


APPENDIX B 


An Example 


Suppose that the differential picture consists of a 4 X 4 grid, the 
points of which are lettered as follows: 
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The signs of the units in the 16 X 16 cyclic Hadamard matrix with 
which the differential picture is smeared may be taken as: 





Ie) a [es aL 





AG ee tale Pele 


Fea 
+] | 
+) + 





|. 
ae 
| + 
+) 1 i+ 
ie 


fest ee ce 
Bes Det ea eed 
PSR 
—|+]-]+ 











cao 





The induced coordinatization may be read from the 2nd, 3rd, 4th and 
5th rows. It is: 
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Representation as Sum of 
Odd Number of Affine 


Grid Point | Coordinatization Basis Vectors 
A 0000 H+I1I+1L 
B O0O0o1 E+H+I1+4L+P 
C 0010 H+L+P 
D 0100 E+I1I+41L 
E 1001 E 
F Oo0l1l E+H+L 
G 0110 E+L+P 
H 1101 H 
I 1010 I 
J 0101 I+L+P 
K 1011 E+I+P 
L 0111 L 
M l11iil H+I1I+P 
N 1110 E+H+I 
O 1100 E+H+P 
P 1000 P 


The coordinates of B through P may also be taken as the successive 
contents of this shift register: 


Now suppose the differential picture is this: 
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The coordinates of the nonzero inputs are as follows: 


+ 


+ 


+ 


-+ 


++ + + 


+ 


+++ | 


E 
H 
I 

L 
P 


aa 
+ 


1001 
1101 
1010 
0111 


1000 


0001=sum 


These are affine independent, so our main theorem applies. The stray 
output pulse will be located at point B, since this is the point whose 
coordinates are the vector sum of the coordinates of the other inputs. 

We now calculate the output vector step by step, without using the 
theorem. The input picture is: 


v = [0, 0, 0, 0, +1, 0,0, —1, +1, 0,0, —1, 0, 0,0, —1]. 


The ‘‘smeared picture” is x = 3v, which is given by the sum of the 
following rows: 


+ — 
+ + 


a 
+ 


++ +1 
“ | 


aie 
| 


| 
++ 4 + 
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+ + 


x = —1,-1,43,-1,+1,-1,-1,-3,+1,+3,-3,-1,-3,+5,+1,41 


The quantized smeared picture is 


y = Qx 


a7 eS ee te ee eee ee earl 
The decoded differential picture is given by 


Z= HY 


[=2, =6, hay 2, +6, =o, ae 
+2, —2, —6]. 


—6, +6, 


= 25.2, 


=0; as 
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When scaled down by a factor of six and placed on the grid, the output is 








In order to match the total power of the input, the output must be 


scaled down slightly more. 
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Binary Codes Which Are Ideals in the Group 
Algebra of an Abelian Group 


By Mrs. F. J. MacWILLIAMS 
(Manuscript received January 13, 1970) 


A cyclic code is an ideal in the group algebra of a special kind of Abelian 
group, namely a cyclic group. Many properties of cyclic codes are special 
cases of properties of ideals in an Abelian group algebra. 

A character of an Abelian group G of order v is, for our purposes, a 
homomorphism of G into the group of vth roots of unity over GF(2). If G 
is cyclic with generator x, the character 1s entirely determined by what it 
does to x; this effect 1s kept, and the characters are discarded. If G is not 
cyclic it is necessary to rehabilitate the characters. Without them the notation 
is impossible; with them one can prove a number of theorems which reduce 
in the special case to well-known properties of cyclic codes. Moreover the 
writer thinks that the general proof is often easier and more suggestive than 
the proof for the special case. To support this point of view we produce a 
new theorem, which of course also applies to cyclic codes. 


I. INTRODUCTION 


A cyclic code is an ideal in the group algebra of a special kind of 
Abelian group, namely a cyclic group. Many properties of cyclic codes 
are special cases of properties of ideals in an Abelian group algebra. 

A character of an Abelian group G of order v is, for our purposes, a 
homomorphism of G into the group of vth roots of unity over GF(2). 
If G is cyclic with generator x, the character is entirely determined by 
what it does to x; this effect is kept, and the characters are discarded. 
If G is not cyclic, it is necessary to rehabilitate the characters. Without 
them the notation is impossible; with them one can prove a number of 
theorems which reduce in the special case to well-known properties of 
cyclic codes. Moreover the writer thinks that the general proof is often 
easier and more suggestive than the proof for the special case. To support 
this point of view we produce a new theorem, which of course also applies 
to cyclic codes. 
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The plan of this paper is as follows: Section II contains a summary of 
the properties of ideals in an Abelian group algebra. Section III contains 
a description of the group characters; the reader is assured (and we hope 
reassured) that an effort has been made to point out the analogies with 
the cyclic case. In Section IV the characters are extended to the group 
algebra. This section contains the general cases of several familiar 
theorems, for example, the dimension of the code, a lower bound on its 
minimum distance, the Mattson-Solomon mapping, and the identifica- 
tion of the dual code. In Section V the structure of product codes is 
examined for the general case. Section VI contains the new theorem 
(which needs too much notation to be explained here) and the special 
case of this theorem which applies to cyclic codes. The Appendix 
contains an illustrative example of the smallest possible nontrivial case. 


II. GENERAL PROPERTIES OF ABELIAN GROUP ALGEBRAS 


Let G be a finite Abelian group of odd order v; the group operation 
is written as multiplication. 

Let R = FG be the group algebra of G over the field F = GF(2). 
FR consists of finite sums 

A= > a9, a,eF, 
geG@ 

In FG we have two operations, addition and multiplication, defined 
as follows: 


As B= 2, (a, + b,)g, 


and for f « G, 
fA = » afg = Dy Oy-109. 


geG 
This implies 
AB = >> dS a,b,h. (1) 
heG@ of=h 
We use 1 to denote the unit of G, and 1, 0 to denote the unit and zero 
of FG. 

From the first of these operations we see that FG has the structure 
of a vector space F’ of dimension v over F. 0 is the zero vector and 1 
is the vector (100 --- 0). 

An ideal @ in FG is defined as follows 


@ is a linear subspace of F’, 
Ae@=gAe@ forall geG. 


ABELIAN GROUP CODES 989 
From the general theory of semi-simple group algebras,’ we know 
that FG is a principal ideal ring; that is, every ideal is of the form 
@ = {rA,re FG} forsomeelement Ae FG. 


We denote the ideal with generator A by (A). In fact every ideal has 
an idempotent generator; @ = (N), where N = o,.¢ mg has the 
properties: 


N* =N, 
reQerN =r. (2) 


Since the ground field is GF(2), and G is commutative 


NS Sag. 
geG 
so that 
N= dy 169 


is idempotent if and only if 7, = »,. for all ge G. 
FG is the direct sum of its minimal ideals, 


BG AO) ese (8, ); 


and every ideal in FG is the direct sum of a subset of these minimal 
ideals.’ The idempotents, 6; , of the minimal ideals are called primitive 
idempotents and have the additional properties 


Ds Op, (3) 


6,0; = 0, 1# js (4) 

(9;) \ (93) = 0, a A Je 
Every idempotent in FG is the sum of primitive idempotents. Since 
we are over GF'(2) the sum of idempotents is idempotent, and the set 
of all idempotents is a vector space I; #;, --- , 4, are a set of linearly 


independent basis elements for I, which is thus of dimension ¢. 
We also define a set of ‘‘trivial” idempotents as follows: 


Let y: = le G. Pick ge G, g ¥ 1, and set 
yo = {9 0°; ie vey g’} 


where g’"”* = g (this must happen since G is finite and of odd order). 
Pick f ¢ y, U y2 and define the set ys; = {f, f’, f*, --: , f’ }. In this way G 


+2 
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is partitioned into disjoint classes, which we call cycles 

G = yy, yn Ygr. (5) 
Define Y; « FG by 

= dg, (for example, Y= gtg?+--- +4"). 

The Y, are the trivial idempotents. From equation (2) it is clear that 
every idempotent is the sum of trivial idempotents, and they are 
obviously linearly independent over F. Hence the trivial idempotents 
also form a basis for J over F. We have proved the following Lemma: 


Lemmai1.14: The number of trivial idempotents is the same as the number 
of primitive idempotents, and each set is linearly dependent on the other; 
that ts, there exists an invertible t X t matrix (m,;) over F such that 


6, ¥a 
= (m;;) 
0, is 


From a practical point of view it is desirable to find the @;. The 
algorithm for doing this is as described in Ref. 2, except that the 
group is no longer cyclic. Briefly, we form linear combination of the 
Y, in a systematic way until we find t idempotents which satisfy equa- 
tions (3) and (4). An example is given in Appendix A. 


IiI. GROUP CHARACTERS 


Since we shall make extensive use of the characters of the group and 
the group algebra, we give a brief account of their properties. 

For our purposes, a character of G is a homomorphism y of G into 
the vth roots of unity over GF(2). These vth roots of unity lie in an 
extension field GF(2’) in which the expression 2” — 1 splits into linear 
factors. They form a cyclic subgroup of the coup pneenney group 
of non-zero elements of this field. 

Formally 


VAG) = v9). (6) 


Hence 
(1) = 
(the unit of G on the left and of GF (2°) on the right) and 
vg") = W@)I" 
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If G is a cyclic group of order v, with generator x, a character is a 
map xz — 6, where 8 is a vth root of unity. In this case one usually does 
not distinguish between the character and the value it assigns to 2. 
We define multiplication of characters by 


(ov) (9) = o(9) ¥(9). 


Under this operation, the characters form a group, X. The unit of &, 
called the principal character ¥,, is the map 


g—1 forall geG. 


The group G and the character group & are isomorphic in many ways. 
We construct a particular isomorphism and use it henceforth. 


Theorem 2.1: (Reference 3) The Abelian group G has a unique decom- 
position as the direct product of cyclic groups of prime power order, 


G=GxXGxX::: XG,, G; cyclic of order p;'. 


(The primes p; are not necessarily district.) 
Pick a generator x; for G; , and a fixed primitive p*‘th root of unity, 
a;. Let y,; be the character defined on the generators by 


Wa; (2;) = ai; We; (2;) = 1, 1A i. 


By equation (6) this is sufficient to define y,, on any g e G. We may 
by equation (7) define y%, 


¥3.(9) = [va 


Lemma 2.2: If ¢ ts any character of G, then @ can be represented in the 
form 


Proof: Let g(a;) = 6. Then 
BP = ox)" = oat") = gl) = 1. 
Thus 8 is a power of a; , say 8 = aj’. We then see that 


(IT at) = [tet = Tat. 


4 


Hence 
e= Tv. 


Set a = [| 2% and denote the character » = [| Yi by ¢,. We then 
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have 


Lemma 2.3: The mapping a = ¢, as defined above is an tsomorphism 
between G and X. 


We also use ¥, to mean the character corresponding to a in this 
isomorphism. 
Lemma 2.4: g.(b) = ¢.(a), and g,-:(b) = ¢.(b’). 
Proof: Let 


: b be 
GS a ees oF, Day eee ee 


Then 


IT | [vz .(b)]** =H} I lesa", 
= Ties = 9,(a). 


The second statement is proved in a similar way. 

We shall need the following theorem which is well known, so the 
proof is omitted. The skeptical reader may easily construct an ele- 
mentary proof by using the properties of the roots of unity. 


Theorem 2.6: 


gab) 


() Zva = a eke 


oe 0 otherwise. 
@) Lw@- f v=, 
ae 0 otherwise. 
If G is cyclic, both parts of this theorem reduce to 
o-1 : if B a 1 
De gi ae ! 1 ’ 
om 0 otherwise. 


Let X(g) be a matrix whose columns are labeled by the characters 
y and rows by the group elements g. The entry in row y, column g 
is ¥(g). An example is given in Appendix A. 


Lemma 2.6: (g)%"(g~") = diagonal [vv --- v] = vl. Hence X(g) ts 
invertible. 


Proof: A typical entry on the main diagonal is 


2d, WVgv(g") = >, v(1) =v, by Theorem 2.5 (ii). 
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A typical off-diagonal entry is 


dy vd ¥G') = Be ¥.(a)¥.(0"'), 
= >> y,(ab™’) = 0, by Theorem 2.5 (72) since a ¥ b. 


IV. CHARACTERS OF THE GROUP ALGEBRA 


If G is a cyclic group of order v, with generator z, and A = A(z) 
an element of FG (that is, a polynomial of degree less than v in x) then 
A(8) is the value of the character x — 6 on A. In the general case, 
for A = >. ,.¢ a,g in FG, we extend the character to the elements of 
the group algebra by 

(A) = 2, ao¥(9). 
Using the notation of Theorem 2.1 and those that follow, we could 
write an element of FG as a sum of terms of the form zi'z}? --- 27°, 
0 <j; < p’’. A isa polynomial in the variables x, , --- , x, with restric- 
tions on the degree of each variable. A character is a mapping A(z, 

- 2.) > A(6,, °°: , B.) where 6; is a (p**)th root of unity. As pointed 
out in the introduction, there are certain advantages to using this 
polynomial notation as little as possible. 

If G is cyclic, we know that 


A(2)B(x) |-@ = AB(a) |.-.. 


Analogously for the general case (and with the same proof, using 
equation (1)), 
W(AB) = ¥(A)¥(B). 
If G is cyclic, it is usually the case that A(6,)A(62) # A(G,62). There 


is however a vital exception, namely A(@)” = A(6’). Similarly, in 
the general case 


vo(A) # Y(A)g(A), but 
Lemma 8.1: ¥(A)* = (A) = V(A’). 


Proof: 
[W(A)? = [d a,¥(g)]* = > a,¥(9)’, 
= », a,¥(g°). 


A cyclic code is an ideal in a cyclic group algebra. It is frequently 
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described as the set of polynomials which vanish on a certain prescribed 
set S of vth roots of unity: 


@ = {A(z): A(®) = 0, Be S}. (7) 


Similarly, we can characterize an ideal in the group algebra of an 
Abelian group as the set of elements of FG which vanish at a prescribed 
set of characters: 


Q= {Ac lG: (A) = 0, ve S}. (7') 


From Lemma 3.1 we see that in the general case, as in the special 
case, the maximal set S corresponding to a particular ideal must have 
a special form; in fact it is the union of sets {y, y’, ¥*, «++ }. 

It is well known that the dimension of the cyclic code associated by 
equation (7) with the set § is the number of vth roots of unity not 
contained in S, that is, the number of nonzeros of the code. Similarly 
in the general case. The following two theorems are proved in Reference 
4; we repeat the proofs here for convenience, and also supply an example 
in Appendix A. Let g:, g2, °°: , g, be the elements of G. Associate 
with the element A the v X v matrix (a,,-:,,). The entry in row 7 column 
j is the coefficient of g; in g;A. The ideal @ = (A) is generated as a sub- 
space of R = F” by the rows of the matrix (a,,-:,,). The dimension 
of this ideal is the rank of this matrix. 


Theorem 8.2: The dimension of the ideal (A) is the number of characters 
y such that ¥(A) # 0. 


Proof: The matrix 97(g7*)(a,,-1,,)(g) has the same rank as (a,,-1,,), 
since by Lemma 2.6 X(g) is invertible. A typical entry of the product 
(a,;-+9,)X(g) is of the form 


ny = >, Og s-19, Vo i( Ga) 


gpeG 
Now 
> Ag 5-19, Wo ;(9x) = d Ag, Vo (9iGx) ey ¥o;(9:)* Wo ;(A). 
Thus 


. ny = Wo;(9i) Wo ;(A). 
In the product X(g"*)(n,;) the diagonal terms are of the form 


,;(A) >» ¥(g")¥(g) = vy,,(A). 
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The off-diagonal terms are of the form 
W,;(A) PB v(g"')¥(f) = 0. 
Thus 


97") "(Gy .-19,) L(g) = diagonal [¥,,(A), v,,(A), Loam ¥,,(A)], 
and the rank of the matrix (a,,-:,,) is the number of characters for 
which ¥(A) ¥ 0. 

We call these characters the non-zeros of the ideal (A). 

Let D be the m X v submatrix of X(g)” whose columns are indexed 
by the group elements and rows by the m characters for which 
¥(A) = 0. If a = (a, a2, -** , a,) is a vector of (A), then Da” = 0. 
If D contains no set of t linearly independent columns, the minimum 
weight in (A) is at least ¢ + 1. This is the extension of the BCH bound 
for cyclic codes. It is generally a very weak lower bound. 


Theorem 8.3: (The Mattson-Solomon mapping—see Reference 5.) 
1 
@ IfA= Diag, then a, => 2) vA’). 
(22) Tf vag = 2 vg"), then W,(A) =z Bi, . 


Proof: 
(7) » WAT’) 


»» », a,w(g)¥(f"), 
Ds Qa, »» v(gf’) = Ua; . 


geG 


(ii) vb(A) = D7 aWn(9), 


geG 


= > ps, BoWs(g ')¥a(Q), 
= 26, ps v/(g"')¥n(9), 
= 2,8, d, vo(f')¥.(h), 
= 2B, z v(f"h), 


= Br. 
Corollary 3.4: A is uniquely determined by the set of values y(A). 


We divide the group G as in equation (5) into cycles corresponding 
to the trivial idempotents of FG, and divide the character group X 
into similar classes by the isomorphism of Lemma 2.3. 
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G=eyUy UN, (5) 
y= W,UYU-:---UYW,. (8) 


y: = 1 and WY, contains only the principal] character y,. By Lemma 
3.1 if y(A) ¥ 0 for some We V;, then Y(A) ¥ 0 for all Pe VW; . The 
non-zeros of A are a union of cycles V; . 

The minimal ideals have the smallest possible dimension, so that by 
Theorem 3.2 the non-zeros of a minimal ideal are, if possible, the char- 
acters in a single class VY, . (This is in fact possible; an explicit construc- 
tion is given in Section V.) If 0; is the idempotent of this minimal ideal 
we may define 6; by the property 


W(6,) = i ae 9) 
0 otherwise. 


Theorem 3.5: The dimension of the ideal (@;) is |W; |, the number of 
elements in WV; . 


Since every ideal in FG is the direct sum of minimal ideals, every 
idempotent is of the form C = >>; ¢:6;, 7 = 0 or 1. The dimension 
of C is >> e; |v; |. From equations (2) and (4) we have immediately: 


Theorem 8.6: If Cy, Cz are idempotents with non-zeros ®, and ®, , and 
®, C ®,, then (C;) is a subideal of (C2). 


The dual code of (N) is the set of vectors b,, --- , b, such that 
>o3., ab; = 0 for all vectors a,, --+ , a, in (N). The dimension of 
the dual code is v — dim (N). 


If N is idempotent, the dimension of ((1 + N)) is v — dim (N). 
This follows at once from the fact that 1 = >0'_, 0;. For A = >> ag, 
set A* = >) a,g™. 


Theorem 3.7: The dual code of (N) is (1 + N)*). 


Proof: Let >> b,-.g e ((1-+N)*); then >> b,g 2 ((1+N)). Since N(1+N) 
= 0, for any >) a,ge (N) we have 


(D0 a,9)(2 bog) = 0. 
From the coefficient of 1 in this product 
> a,0,-1 = 0. 


Therefore, ((1 + N)*) is contained in the dual code of (NV), and has 
dimension v — dim (NV). Thus it is the dual code. 
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V. QUASI-CYCLIC AND PRODUCT CODES 


Let H be a proper subgroup of order u of the Abelian group G; and 
let 


G=hHURHU-+-»Uk,H (kh = 1,0 = w) 


be the decomposition of G into cosets of H. In this section we suppose 
the coordinate places in FG to be arranged in the order 


JT Hees BH Mig Mens BS Heelan 84S Te Reyes athe 


Let @ be an ideal of FG, and denote by @; the part of @ which lies in 
the coordinate places kh, , --- , k;h,. @, is an ideal of FH (usually 
several repetitions of an ideal of FH), and since k;@, = @; the codes 
@, are all repetitions of @,. Each vector of @ consists of w vectors of 
@, ; these are not in general the same vector, and some of them may 
be zero. If H is a cyclic group, @ has the structure of a quasi-cyclic 
code. Since G contains cyclic subgroups of order p for every prime p 
which divides v, @ may have this structure in several different ways. 
- We make the additional assumption that G is the direct product 
G = H X K of subgroups H, K. This means that H ()\ K = 1, and each 
element of G can be expressed uniquely as g = kh, ke K, he H. The 
character group & is correspondingly a direct product 


X= Xy XK Xx, 


where Xy , Xx are the images of H, K under the isomorphism of Lemma 
2.3. Every character can be expressed uniquely as 


VY = Gu¢K , gx te XLy , greXr. 


We shall need the following result. 


Lemma 4.1: oyex (hk) = en(h)ex(k). 


Proof: From the isomorphism of Lemma 2.3, 
gu({k) = 1, ox(h) = 1. 


Let A = Doan ah, B = YOux b,& be idempotents in the group 
algebras FH, FK. Let ®,, @x be the non-zeros of A, B respectively. 
®, , Pg correspond to cycles of Xz , Xx which are, of course, also cycles 
of &X. 

The Kronecker product of matrices, M/, N, is denoted by M X N 
(an example is given in Appendix A). 
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Theorem 4.2: (i) C = AB ts an idempotent of FG. 


(i7) The codes (C) is the direct product of codes (A), (B). 
(212) The non-zeros of (C) are guex, Gxt Py, Oxt Ox. 
(iv) The minimum distance of (C) is the product of those of (A), (B). 
Proof: (2) Itis clear that C = ae x 0,k > 1 ,h is idempotent. 
(iz) The first row of the Kronecker product 
(a) X (0) = (Gay-tar) &K (Ors-220) 
consists of the coefficients of C. The second row contains the coefficients 
of h,C, and the (wu + 1)st row the coefficients of k.C. Without further 
notation, we see that the rows of this Kronecker product generate the 
code (C’) as a subspace of F”’. 
(iz) X(G) = Xy(h) K Lx(k). By Theorem 3.2, the non-zeros of C 
are given by 
[eu(h") KX Lx(k-")]"[(a) X (6)][u(H) XK Lx(K)] 
= X7(h"')(a)Xu(H) X E(k’) (b)lx(K). 
The triple matrix products are diagonal matrices with ones in the 
places corresponding to g « ®, (ye x) and zeros elsewhere. Their 


Kronecker product is a diagonal matrix with ones in the places cor- 
responding to CuK, Pu & P;, > YKE Dx . 


(iv) This is a well-known property of direct product codes. 


Given an idempotent C' of FG we would like to know how, if possible, 
to find subgroups H, K such that G = H X K, and C = AB. The 
following theorem is sometimes helpful. 


Theorem 4.8: Let V be the set of non-zeros of C; suppose Y can be expressed 
as the product of two sets of cycles ®,, &, where ®, © Xy, Be e Xx and 
XL = Xy X Xx. (Consequently, G = H X K.) 


Then C = AB, where A, B are idempotents in FH and FK, with 
non-zeros ®, , ®, ; consequently the code (C) is the direct product of codes 
(A) and (B). 


Proof: 
C= De Aunkh = ky a Gy, nh + ke De Arash +--- +k, d a, frh. 


kheG heH 


By Theorem 3.3 (2) 
Qksn = PD W(C)V(k;'h-’) = > 23 gipa(kih’), 


G1ePs GoePs 
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since by hypothesis 
1 if = 
W(C) = y PiP2 


0 otherwise. 


dy g2(kz’) 2d ei(h™) 


G2ePs gies, 


by Lemma 4.1. Set a, = Doyen anW(h7'), where 
a, = ( yp & ®, 


0 otherwise. 


Set A = ae" a,h; then 


W(A) = % = 


0 otherwise 


i Ye ®, 


by Lemma 3.5 (72). Define B similarly for K. Then A, B are idempotents 
in FH, FK, and C = AB. 

If H, K are cyclie groups whose orders are relatively prime, then G 
is also cyclic. The codes (A), (B) are cyclic codes in FH, FK mea peceivelys 
and (C) is a cyclic code in FG. 

This special case has been thoroughly investigated by Burton and 
Weldon’ and Goethals.” 

The extension to direct products of more than two subgroups is 
theoretically obvious, but rather hard to visualize. An example for the 
cyclic case is given in Appendix 2. 


VI. A NEW THEOREM 


Everything in this paper so far is a natural extension of known results 
about cyclic codes. This section is not; the special case of Theorem 5 
for G cyclic is new and interesting (at least the writer thinks so). 

The primitive idempotents @, of FG have been defined by the property 


1 WV; ’ 
v(6,) = Os (10) 
O peY,. 
We recall that the trivial idempotents are defined by the property 
Y; = Ss 459, 4, = , a a4 (11) 
geG 0 g ¢ Yi ‘ 


Since these properties look remarkably symmetrical, one expects to 
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find some symmetry in the matrix (m,;;) (Lemma 1.1) which relates 
6; to Y, . This in fact exists, as follows. 
We recall that 
= > G0: 


9G 


Theorem 6.1: 
t t 
0; = s T.Y - bea = a T.Op P 
k=l k=1 


Proof: Let 


» b9 


yeG 


Dy dD (9)¥G")g by Lemma 3.3 7 


geG per 


(Note that 1/v = 1 in characteristic 2.) 
= 2,( 2, wg ))g by (10). 


From definition (8) of %;, we may suppose that ¥, = {y, Wp, 
- , Wat}. Then the inner sum is 


v/(g-’) or vp(g-’) ss as Vre(g’) 
= v,(f’) + v(f) i mits + vi(f°) by Lemma 2.4, 


= ¥,(Y%). 


bas 
I 


I 


Thus 
= > WAY *)g. (12) 


(This is the explicit construction for 6; .) Now suppose 


bide, _ = 7,9, 7 rT, € GF(2). 
kel 
t 
¥(Y%) = ps reWo(81), 


¥(0.) = i‘ 9°06) fom (9). 
0, g ¢ Y, . 

Hence ¥,,(Y*) = 7,, ¥,(Y*) = r, forall ge Y,. PuveMeaune in equa- 

tion (11), we obtain 
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é,=n+7. Mgt tn POE F 


geYo ge Y¢ 


Let ¥,(Y;) be the common value of ¥(Y;) for ye VY, . Equation (12) 


then becomes 


2 
., 
I 


2s Vi( Y*) Y;, ’ 


t 
SS reY. : 


kel 


(18) 


With a slight change of notation, let 


t 
6; = > Mir Vx . 
k=l 
Let P be a permutation matrix such that P acting on the column 
vector (Y,, Y2,--: , Y,)” produces (Y%¥ , Y¥,--+ , VY)”. 
Theorem 8.2: 
i) =P. 
Proof: By Theorem 5.1, 
t t t 
Y# = » MO. = y, Mik > Ms Vs , 
k=1 k=1 ; 


j=1 


t 


t 
dS ( mass) Y; = 
k=1 


7=1 


Hence 
: 1 Ye = Y; ’ 
>. Mi, = 
_ 0 otherwise. 


We give a brief description of the special case G cyclic of prime order p. 
FG is now the polynomial ring R = Flx]/z? + 1. Let f be the order 
of 2 mod p. If p — 1 = ef, then 


2=¢° 


for some generator g of the integers mod p. The trivial idempotents, 
other than 1, are of the form 


ge ge ag ee sa iad gee. 
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Let o be the automorphism of R induced by x — 2’; define 


Xoeatetatt-.. ta", X¥,=Xiw, t= 1,---,e - 1. 
Then 

X, =a tate tee pat, oa = gl’. 
Since the trivial idempotents were previously called Y,, --: , Y; we 


have changed notation; now 
Yl, V9. & XG 5.08 Vee es 
We rename the primitive idempotents correspondingly, 
6, = J, O62 = yo, ''' > Oe = Ne-- 
The characters of G are defined by 
W(t) = a’, 
where @ is a primitive pth root of unity; thus 7; is defined by 
ee if k= g**, 
0 otherwise. 
This may be rewritten as 
a = , t~=k, (14) 
0 otherwise. 
In particular 
1 a =1, 


0 otherwise. 


J(a’) = 


Hence 

p-l1 

d= x 

7=0 
Write 

e-l 

n= M+ b> MirX 4 ; 
k=0 
m, = X*(J) = f, 


Mix ss X*(a°***) . 
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Since —1 = g°/” we have 
Xt = i : f even, (15) 
Xises2 J odd. 
Now 
e-1 
ng = f+ DS MirXbsr ; 
and 


Min a Xo = xt, a 
by equation (14) and the definition of X; . Hence 


Mix = Mi-1,%41 4 


and 
no = f+ > Mi-1,;X; = -1 - 
Set 
e-1 
= f+ 2 MX 5 
then 
0, = 00° =f+ » MrsiXp « 
Clearly 
J=1+ ae 2 ‘ 


The matrix corresponding to the (m;;) of Theorem 5.2 is of the form 


rad 
f’ M 


where J, f are now vectors of length e and 
Mo ; MM, *'% y  Me-1 
M = Mm, Mo, *** 5 Mo 


Me-1 ’ Mo ’ aorpra: M,.~2 
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Let P be the permutation matrix which turns the column vector 
(1, Xo, °°: , Xe-1)” into (1, X*¥, --- , X*,,)”. By equation (14) 


P=TI for f even, 





P= : | Foi 
07 ov 
where 
0 1 O «2 “& 
Q=f 0 0 1 ID JL] ofsiee @-) x @-. 
] 0 ee inet 0 


From Theorem 5.2 we have 
ltée, J+1M)_p 
f? + Mf” f1+ 
For f even, f71 is ane X e matrix of zeros, hence 
M* =I if f is even; 
for f odd, £71 is an e X e matrix of ones, which we denote by K. 
M=K+Q” if f is odd. 


The matrix MM, which is symmetric and circulant in the wrong direction, 
can be made circulant in the usual way by multiplication by a suitable 
permutation matrix. Skipping the obvious details we have the following 
theorem. 


Theorem 5.8: With yn; , X; defined as above, and 


e-1 
No = Mee + Dm X, , 


i=l) 


e-1 


(2) Xo = Me = > MeN . 


t=0 


(17) Set 


my) = mM + my + my? + +) + moay™, 


my)” = M + MY + m,-oY° 5 Oeiaa 3 my. 
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Then 
(i) fiestas f even, 
1 f odd. 
(it) m(y)m(y)” =1 mod y° +1, f even, 


e~-} 
= pS y + ye”, f odd. 


Theorem 5.3 has several interesting corollaries of which we mention 
one. 

Let w be the weight (the number of non-zero coordinates) of m(y). 
The following statements come from Theorem 5.3. 

The weight of X; = The dimension of (;) = f. 

The weight of 7; = The dimension of (X,;) = wf, f = 0(2), wf +1, 
f = 1(2). 


Corollary 5.4: If p = 2° — 1, (X;) is a (2* — 1, 2°") code, with mini- 
mum weight < k. 


Proof: For p = 2° — 1, we have f = k. Clearly the minimum weight 
in (X,) is bounded above by that of X, , which is k. 


The minimal ideal (7;) is the dual of a Hamming code. Hence », (and 
every other non-zero code word) has weight (p + 1)/2 = ef/2 + 1. 


Thus w = e/2, and the dimension of (X;) is (p + 1)/2. 


We can use Theorem 5.3 to discover some other remarkably poor 
cyclic codes; for example 

p = 251, e 

p = 1801, e 


I 
I 


16, f = 16, w 
12,5) = 25,0 


9, 
39. 


| 
I 


[After the completion of this paper, the writer discovered that Abelian 
Group Codes have also been investigated by Berman (KIBERNETIKA, 
vol. 3, no. 3, 1967) and by Paul Camion (to appear).] 


VII. CONCLUSION 


The writer regretfully admits that she has made no attempt whatso- 
ever to find out whether general Abelian group codes are of any practical 
value. One obvious thing to do is to make a computer search; the 
algorithm for finding the primitive idempotents is quite easy to imple- 
ment. Another direction of research is to look for a class of groups, not 
cyclic, which produce codes with some desirable practical properties. 
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APPENDIX A 


An Example of a Non-Cyclic Abelian Group 


Let G be the group of order 9 which is the direct product of two groups 
of order 3. The elements of G are 


2 2 2 2 2: S. 3 3 
1, 2,299; LY, 2; Y 2, CY, aay =1. 


Let @ be a primitive third root of unity over GF(2); then 


The matrix %(g) is: 




































































Wry? Yory2 
] 1 
2 
a a 
2 
Qa a 
2 2 
a a 
] a 
a 1 
a a 
2 
a 1 
2 
1 a 

















It is symmetric because the characters are written in the same order 
as the group elements to which they correspond; the argument does not 
use the symmetry of (9). 
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The trivial idempotents are 
Y,=1;Y,=¢+"%7;Y,=yty;Y¥,=ayt+2'y; Y; = avy +2y’. 

In order to find the primitive idempotents we need the multiplication 
table for the Y; . This also is symmetric and we write only half of it. 











Ys Ys Y, Ys 
be ¥3 > —_ >= 
Y; Y, = Y; Ys TTS . 
Vy Y; a Y; Y, + Y; Y, ae 
Y; Y;+ VY, Ysera Y2+ Ys Y; 


We have then 
1=Y,.+ 04 Y.); Y,; = Y:Y. + Y3(1 + Y2); 
(1+ Y;s) = (1+ Y3)¥2+0+ ¥;3)0 + Y2). 
Thus 
Y; = (Y¥a+ Ys) + (Y¥3 + Ya + Ys), 

1+ Y; = (ese Yaer Ys) FO ae aa ¥e 3G eV) 
1=Y,;+ (0+ Y;3) = (Yat Ys) + (Vs + Ya + Ys) 

Se Vg Mao Vig) de Ob epee dg ae Viet a) 
We multiply this equation by Y, and (1 + Y,): 

Y,= Ve Yack Yo FiWs +a Ys) +(Y.+Y.,+ Y;) +0, 
1+ Y,= (Ye+ Y3 + Ys) +O0+04+ 0 + Yat Ys + Yo + Ys). 
Finally, 
1= Y,+ (+ Y,) — (Yo + Ye + Ys) + (V3 + Ya + Ys) 

+ (Yo +¥e+ Ys) + (Yat Vet ¥) ++ ¥ot+¥et+ Yat ¥). 


This is a decomposition of 1 into five mutually orthogonal idempotents, 
which are therefore the primitive idempotents. Set 


A=Y2+Y,;+Y,=2e¢4+r74+yt+tytay42°y’. 
We use the table X(g) to check that 
W.(A) = W22(A) = p,(A) = Wy(A) 
Wey( A) = Wory2(A) = 1. 


Hence 


I 


W22,(A) = Wy:(A) = 0 


Yo+ Y¥3 + Yq 


64. 
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Similarly 
Y,+ Y,+ Y; = 4, 
Y.+Y¥,+ Y; = 4, 
Yo+t Y;+ Y; = 6, 
1+ Y¥.+Y,;+Y¥,+Y;,;=6. 


The matrix (@,,-.,,) for the trivial idempotent Y, is 

















1. ee yw xy’ yo ay ay? 

1 0 1 1 0 0 0 0 0 0 

z 1 0 1 0 0 0 0 0 0 
x 1 1 0 0 0 0 0 0 0 

y 0 0 0 0 1 1 0 0 0 
xy 0 0 0 i 0 1 0 0 0 
xy” 0 0 0 1 1 0 0 0 0 
y° 0 0 0 0 0 0 0 1 1 
xy” 0 oO O 0 oO 0 1 O 1 
xy? 0 oO 0 0 oO O 1 1 O 


To save space we write this as the Kronecker product 


1 0 0 011 b 0 0 011 
0 1 0) Xj]1 0 1) = 0 Bb OO], b= 1 
00 1 1 1 0 0 0 b 1 1 0 
and also write X(g) as the Kronecker product. 
od 11 ii a a a 1 1 ] 
la @vixX/la a@vl=fa aa aia , a=!l a a 
1 a a 1 o& a a@aa aa 1 a a 
a’ a’ a’ 
X(g"') = ja’ aa’ aa’ 
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where 
11 1 
a=|1 a@& a 
eae. 


It is then easy to calculate that 


aba’ O 0 
X(9™")(Ay;-15)X(g) = | 0 aba’ 0 
0 0 aba’ 
where 
P 0 0 
aba’ = 10 1 O}- 
00 1 


Theorem 3.4 then says that the non-zeros of Y, are 


We (W.)”, Ves (Wow) s Very (Wry) 


which is obvious from the array X(g). 

This is also an illustration, though a rather trivial one, of Theorem 4.2. 
H is the group (1, x, x”); K is the group (1, y, y’). A is the ideal 2 + 2° 
in FH, and B the ideal 1 in FK. The non-zeros of A are y, , ¥2 and 
the non-zeros of B are y, , ¥, , ¥; . Clearly Y. = AB, and the non- 
zeros of Y. are the products Py , as above. 

We can also check Theorems 5.1 and 5.2 from the following table: 


(= Va Vat Vase Ya Vz., 
O2 = Y, bas Les 
63 = Vere Yaar Vey 
6, = Ve Vek Va 

65 = Y,+ Y; + Y;. 


It is clear that 
Y; 6, + 82+ 63 + 0, + 6; , 


Y, = 05 + 6, + 6;, and so on; 
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and 


(mi;)” re 


oe aoe 
-_- -—& CO 
—- = = CO 
oon EE oo 


APPENDIX B 


An Example of the Product of Three Cyclic Codes 


Let H, K, L be cyclic groups of orders 3, 5, 7 respectively. Their 
direct product G is cyclic of order 105. (Unfortunately, this is the smallest 
possible example.) 


Write 
-_ 2, ae 23 4, ee 2 3 4 5 6 
AS 1, a5 | Ge Pa ae | Ode We e-ree ahe ae e e 


Let (A,) (8, 2) and (A.) (5, 4) be the single parity check codes in FH, FK, 
with idempotents 

Ap=at@; As=ytytyty* 
Let (A3) (7, 4) be the Hamming code in FL, with idempotent 

Azs=l+ete2?t 2%, 

The direct product code has idempotent C = A,A;A;3. (C) isa (105, 32) 
cyclic code, with minimum distance 12. Hach vector of C can be repre- 
sented as a three-dimensional array of ones and zeros, which are situated 
at the lattice points corresponding to 2z’y’z* in Fig. 1. (The origin 
is 2 y°z.) The lines of this array which are parallel to the x-axis are 
vectors of (A,); those parallel to the y-axis belong to (A.), and those 
parallel to the z-axis to (A;). 

It has been suggested (see Ref. 8) that an array like this be used 
for simultaneous burst and random error correction. It must however be 
borne in mind that such a code will be highly redundant. 

To express C' as a cyclic code we write the lattice points in order 
1, u, wu, wy +++ w'°*, where u is a generator of the cyclic group G, for 
example » = ayz. With this choice z‘y’z* becomes yu” where n is the 
least integer such that 


n—1t = 0(8); n— j = 0(5); n—k = 0(7) 
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76 
zZ5 
24 


z3 





Fig. 1—v*yiz*, 


for example: 


xyz = (xyz). 
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Delta Modulation Codec for Telephone 
Transmission and Switching 


Applications 


By R. R. LAANE and B. T. MURPHY 
(Manuscript received January 12, 1970) 


A highly integrable delta modulation codec design, for applications 
where transmission bandwidth is not at a premium but where an inex- 
pensive and high quality converter is desired, is considered in this paper. 
An asymmetrical codec integrator is used to improve quantizing noise 
characteristics. Charge parceling techniques which are used for performing 
the integrating function, offer advantage over conventional RC integrators. 


I, INTRODUCTION 


Delta modulation’ (AM) is receiving interest for voiceband analog- 
to-digital (A/D) conversion applications where coding efficiency is less 
important than the requirement for economical, but high quality A/D 
conversion, characteristics. One of the potential applications is in pulse 
code modulation (PCM) coding systems where the PCM code is formed 
by first converting the analog signals into a single bit digital code using 
per terminal AM coders. The AM bit stream is then converted into a 
PCM format using digital filtering.”"* The technique takes advantage 
of the simple means of providing A/D conversion with AM and utilizes 
highly integrable digital hardware for providing the AM to PCM 
conversion. 

Another promising application of AM is in space division switching 
networks. Analog inputs to the network are converted into a digital 
code by per terminal AM coders. This allows implementation of digital 
switching networks which are more ideally suited to integrated semi- 
conductor technology than analog networks. Requirements on network 
loss, signal distortion and crosstalk are significantly relieved. 

We describe the design of a single integration AM codec (coder- 
decoder) which shows promise of meeting the conversion requirements 
for both of the above applications. Improved conversion characteristics 
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are achieved using planned integrator asymmetry. For accurate coding 
characteristics at high clock (sampling) rates, charge parceling tech- 
niques are used in the integrator for reconstructing the anlaog signals. 
A highly integrable design which offers economical, per line A/D 
conversion is achieved. 


II. DELTA MODULATION CODING REQUIREMENTS 


2.1 Codec Operation 


A block diagram of a delta modulation codec, using single integration, 
is shown in Fig. 1. To perform the analog-to-digital conversion, an analog 
input is compared with a reconstructed version of itself from the coder 
integrator. The relative difference between the signals is translated into 
a single bit digital code by clocking the output of the comparator stage. 
The code is then transmitted to both coder and decoder integrators where 
it forces either a small positive or negative voltage change in the inte- 
grator output. Thus, a single bit code is used for controlling the inte- 
grator voltage and causes the integrator to produce a close track of the 
input signal. If a matched integrator is placed at the decoder, a similar 
track of the analog signal is recovered. 


2.2 Overload Characteristics 


Because the transmitted digital code contains information correspond- 
ing to the derivative of the message function, overload characteristics 
with delta modulation become a function of signal slope instead of am- 
plitude. The overload point occurs when the integrator is forced to 
produce a similar polarity voltage step during each clock cycle. Thus, 


CODER 
ANALOG 
INPUT ae. 


CLOCK 










DIGITAL 
OUTPUT 


COMPARATOR 


P 
| 
INTEGRATOR | 
OUTPUT 

INTEGRATOR Ma 
DECODER | 
| 
ANALOG al 

—wrerator J 


INPUT 


Fig. 1—Delta modulation codec (coder-decoder). 
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for a sampling frequency, f, , and an integrator step, co, the maximum 
integrator voltage slope is given by 


of. . 
The maximum slope of a sine wave of amplitude, A, and frequency, 
f, is 
QrfA, 

and therefore overload occurs when 

2QrfA = of, . (1) 
This sets the product of o and f, ; their relative values are set by quantiz- 
ing noise requirements for meeting signal to noise objectives. 
2.3 Quantizing Noise 


Granular quantizing noise has been the subject of numerous papers 
including works by Van de Weg,* Wang,’ and Iwersen.° The theory 
developed by Iwersen predicts a quantizing noise spectrum which is 
in good agreement with measured noise in delta modulators.’ To improve 
noise characteristics of our delta modulator, his theory is used for 
optimizing coding characteristics. Some of the basic equations and 
calculations which govern the design of our codec are reviewed in this 
section. A detailed description is given in Ref. 6. 

The effect of an asymmetrical integrator is shown in Fig. 2, where 
coding of an idle channel input or a de signal is illustrated. In this ex- 
ample, the positive integrator step, o, , is larger than the negative step, 
o_ ; that is, 


Oo, =ao--e, 
go~ = —o+e. 


As a result, a sawtooth error wave of peak-to-peak amplitude o is 
generated. The noise spectrum resulting from coding a steady-state 
input with unbalanced integrators consists of frequencies given by 


fi = | QUA — 8)/2If. | (2) 
where 
Q(a) = «a — N(@). 
N(q) is the integer nearest a, and 


ov = e/e. 
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TIME 


AMPLITUDE — 





Fig. 2—Integrator output for an asymmetrical coder, shown with |o,| > |c_l. 


The power at the frequency of index 7 is calculated from 
P, = 20°/n'l’. (3) 


The frequencies for which / is even are components of the sawtooth 
wave of peak-to-peak amplitude o and fundamental frequency df, . 
Additional, not so evident, sawteeth are also usually present. For ex- 
ample, components of a second order sawtooth are calculated by choos- 
ing values of 7 equal to aN, where a is a positive integer and N is the 
odd integer nearest 1/3. This sawtooth has a peak-to-peak amplitude 
of 2c/N ~& 2 and a fundamental frequency of | (1 — N#&)f,/2 |. 

It is possible to significantly reduce inband quantizing noise for low 
level inputs by using a planned integrator imbalance.*’*? However, to 
guarantee good noise characteristics, the imbalance must be maintained 
between a lower and an upper limit. The lower limit must allow an 
imbalance to force the fundamental component of the fundamental 
sawtooth wave above voiceband. The change (the spreading) of the 
noise spectral lines resulting from phase modulation of the idle channel 
spectrum by the input signal must also be considered. 

The upper limit on integrator imbalance is set by quantizing noise 
objectives. It is difficult to guarantee that the fundamental component 
of the second order sawtooth wave (of peak-to-peak amplitude = 2e) 
will be kept out of voiceband. Therefore, the magnitude of the inte- 
grator step imbalance, e, must be maintained at a level where it will not 
introduce excessive inband noise problems at low or quiescent input 
levels. 

Iwersen has calculated the quantizing noise as a function of signal 
level for various step imbalances using a 12-millivolt integrator step 
size and a 1.544-MHz sampling rate. The calculation is made for a 
broadband input and uses C-message weighting of the noise in the 
voiceband. Results are plotted in Fig. 3. The advantage of using an 
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f;5=1.544 MHz 
g=i2mvV 











NOISE IN GBrnc 














~50 -40 -30 -20 -10 re) 
SIGNAL IN dBm 


Fig. 3—Calculated quantizing noise versus average speech power for ? = 0, 0.02, 
0.05, and 0.10. 


asymmetrical integrator over a perfectly balanced integrator (8 = 0) is 
clearly evident for low level input signals. An optimal range of integrator 
imbalance falls into a range from 3 = .02 tod = 0.10. At higher values 
of } increased quantizing noise is produced at quiescent levels and at 
low signal levels. For lower values of @, the fundamental sawtooth fre- 
quency is close to voiceband and phase modulation of the spectral lines 
due to input signals causes excessive quantizing noise energy to fall 
into voiceband. 


2.4 Transmission Objectives 


Design objectives for this experiment* require that coding a 6-volt 
peak-to-peak (-+7 dBm into 9002), 1000-Hz signal will not cause slope 
overloading. This corresponds to a maximum integrator voltage slope 


* The design objectives should not be interpreted as official Bell System trans- 
mission objectives. They were established for this experiment as a guide line for 
providing a reasonable quality of voiceband conversion. 
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capability of 
of, 19 X 10° volts/second. 


Additional design objectives include a 50-dB dynamic range with 
a signal-to-noise ratio better than 25 dB at the —43-dBm level, in- 
creasing to better than 40 dB at the +7-dBm level. Quantizing noise 
of idle channels (zero input) should be maintained below 16 dBrn using 
C-message weighting. Design objectives for gain (loss) variation, over 
a 200-Hz to 3200-Hz frequency bandwidth, require an average 0.5-dB 
loss with loss variation maintained within +0.25 dB. 


III. CODEC CIRCUIT DESIGN 


The relative simplicity of the delta modulation coding function makes 
feasible the realization of highly integrable and inexpensive codec con- 
figurations. In this design, the codec circuitry utilizes a combination of 
integrated semiconductor and thin-film capacitor techniques and relies 
heavily on the characteristics of the two technologies—excellent match- 
ing of device characteristics on an integrated circuit (IC) chip and close 
ratio tolerances between capacitors on thin-film capacitor arrays. These 
characteristics are of special importance in the key element of the codec, 
the integrator network, where a high degree of precision is needed for 
accurately reconstructing an analog signal from a digital input signal. 


3.1 Charge Parceling 


The integrator utilizes a charge parceling circuit (sometimes called 
the “‘bucket and dipper” circuit), shown in I'ig. 4, to provide the digital- 
to-analog conversion function. The prime advantage of the charge 
parceling approach is that it relies only on capacitor ratio tolerances 
for making the digital-to-analog conversion rather than on absolute 
resistor and capacitor tolerances required by the more commonly used 
RC type integrators. Timing problems are also not critical, provided 
sufficient time is allowed for charging and discharging small charge 
parceling capacitors in the integrator network. 

To add a voltage step to the integrating capacitor, C, , (Fig. 4) the 
+1 input (clock input) is applied to produce a positive voltage step 
of AV at the C,, capacitor terminal. This causes an equivalent change 
in voltage on both of the capacitor terminals until the increase exceeds 
the threshold of 7. As 7, turns on, charge is dumped from C,, to C;, 
producing a voltage step on C; proportional to the two capacitors 


Ca , 
AV; = la Sta lav (4) 
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+1 INPUT 
(CLOCK) 





-2 INPUT 
(DIGITAL INPUT) 


Fig. 4——Charge parceling circuit for AM integrator. 


where AV, is the voltage change on C; and AV’ is the voltage change at 
the C,, terminal after 7’, begins to conduct. 

As the input returns to its original level, a negative voltage step of 
AV is forced on the C,, terminals. This change causes T, to become re- 
verse biased and turns 7’, on to recharge the C’,, capacitor. The recharge 
voltage for C,, is governed by the compensation network which trans- 
lates a voltage V., approximately equal to the integrating capacitor 
voltage V, , to the base of 7, . Thus, C,, is recharged to V, minus the 
base to emitter drop of 7, , and the net increase in integrator voltage 
due to a +1 input is 


C 
ie =_ — Varr, — Veen — — JV. (_-&-)- 
o4, = AV; = [AV BET, pier, — (Vr — V,)] G4. 


(5) 


Note that the effect of the junction capacitances of 7, and T, also must 
be considered in the step size calculation, but the effect is secondary 
and is not described in detail here. Equations including junction capaci- 
tances are given in the appendix. 

Changes due to temperature in the threshold voltages of Vgz,, and 
Vezr, are compensated by controlling the amplitude of the voltage 
step (AV) with a de voltage V, and two matching base to emitter (diode) 
voltages Vp. Therefore, the effective +1 integrator step can be rep- 


1020 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1970 


resented as 


oo = [V, + 2V_ — 2V an — (Vr — valle S62): ” 


To produce a negative step to the integrator the —2 input (digital 
input) is activated. This causes a voltage change on C_, similar to the 
change produced on C,, by the +1 input. However, charge is removed 
from C, during the negative slope of AV and is dumped from C_, to 
the —V supply during the positive slope of AV. The effective voltage 
step produced at the integrating capacitor by this input is given by 


o-2 = [V, + 2V_n — 2Vun — (Vi — voi) (7) 
where ay is the common base current gain of 73. To minimize the 
effect of ap variation, a Darlington transistor pair is planned as a re- 
placement for 7’; in future models. 

In the delta modulation codec, a clock controls the +1 step input to 
the integrator, and consequently a o,, step is added to the integrator 
during each clock cycle. The digital input to the integrator controls 
the —2 step input. When present, it decreases the integrator voltage by 
o-2. Therefore, whenever a digital input is applied, the net change in 
the integrator voltage is ¢,; — o_, . When the digital input is not applied, 
the clock automatically raises the integrator by a o,, step. 

To optimize quantizing noise characteristics, the integrator asym- 
metry is designed for 





0.02 <3 < 0.10 
or, translated to the integrator step requirements (for the case co, > c_) 


2 — 1.89 + 0.07. 


+1 

An additional feature of the charge parceling integrator is that gain 
(or loss) between coder and decoder integrators can be easily adjusted. 
Gain can be adjusted either by changing the step generator voltage, 
V,, between integrators or by using a different ratio of integrating to 
charge parceling capacitors on the coder and decoder integrators. The 
first technique might be useful as a form of automatic gain control which 
can be adjusted as a function of voltage. The second technique would 
be useful when a predetermined amount of gain (or loss) is desired. 


3.2 Compensation Network 


To prevent the decoder output from drifting to either a maximum 
positive or negative output voltage as a result of differences in the plus 
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and minus steps between the coder and decoder integrators, the decoder 
integrator must compensate to automatically adjust its step imbalance 
to match the coder integrator step imbalance. Compensation is not 
needed in the coder integrator but is added to help match coding char- 
acteristics between coder and decoder. 

With the charge parceling circuits described, compensation is achieved 
by adjusting o,, of the decoder integrator as a function of the integrator 
voltage level. The step size is varied by adjusting the recharge voltage 
for the C’,, capacitor. 

Figure 5 shows a schematic of the compensation network. A nearly 
linear compensation as a function of integrator output voltage is 
achieved by this configuration. Good reproducibility of compensation 
characteristics between integrators is also possible. Operation is as 
follows. 

With no current through R, , a voltage equivalent to V; is translated 
to the base of 7, for recharging the +1 step capacitor, C,,. As V; 
increases above this level, V¢ also increases but begins to lag farther 
and farther behind V, , because #, can no longer supply all of the current 
required by the current source J, . The balance of the current is supplied 
through R,, and the voltage drop across R, determines the difference 
between Vz, and V;. 

The opposite happens as the integrator voltage drops below the bias 
point set by R, and J, . Then R, will supply more current than accept- 


Ve 





Fig. 5—Integrator step compensation circuit. 
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able to the current source J, , and the excess current will be forced to 
flow in R,, in this case keeping Vg more positive than V;. Thus, the 
amount that C’,, is recharged after each voltage step becomes a function 
of the integrator voltage level, decreasing with increasing integrator 
voltage and increasing with decreasing voltage. The net effect of com- 
pensation on the +1 step size can be expressed as 


=-1 
O41 = O41 — av,(1 = 7a) (= | (8) 
where o,, is the +1 step at V; and o’, is the +1 step at V; + AV,. 

Gain (loss) variation between coder and decoder is controlled by 
reproducing the c_, step size. This step size becomes primarily a function 
of the de step control voltage, V, , the ratio of C_, to C; , and the com- 
mon base current gain of the negative step charge parceling transistor. 
To meet gain (loss) variation requirements, a +1 percent tolerance is 
needed for both the step voltage and the capacitor ratio. 

The compensation network automatically adjusts o,, in the decoder 
to match the o,, to ¢_, ratio between coder and decoder. Since a small 
current is derived from C; as bias for the output buffer stage (7, and 
T’.), a drain is produced on o,, . Variations in bias current between coder 
and decoder cause a difference in the effective o,, step size but are 
counterbalanced by the compensation network. To minimize drain 
from C;, a high impedance connection is made at the analog output 
terminal from the integrator. 


3.3 Codec Building Blocks 


Figures 6 and 7 are schematics of the comparator and the integrator 
circuits, respectively. A combination of a comparator and an integrator 
are required by the coder; only an integrator is required by the decoder. 
Figure 8 is the block diagram of a complete codec. Thus, a codec is 
implemented from 61 transistors and diodes, 54 resistors, and 6 capaci- 
tors. Transistors, diodes, and resistors are fabricated by IC techniques, 
the capacitors by thin-film techniques. 

The comparator stage (Fig. 6) compares an analog input with an 
integrator output signal. Their difference is amplified by approximately 
200 and is passed to a latching circuit (7's, T> , T12). The latch holds 
the state of the comparator output for the duration of the clock pulse 
and allows conversion of the output to a single bit digital code by the 
clocked output gate (7,,, 71:). To avoid a race condition cLocKl 
input is made to overlap the pulse from cLock2 input. 

The integrator network (Fig. 7) converts the clock and digital inputs 
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Fig. 6—AM comparator. 


at the integrator into controlled amplitude voltage steps by first am- 
plifying the input pulses (7'.) and then generating a voltage step (73 , T's) 
to the charge parceling circuit. The step is controlled by a voltage 
control (V, = 1 volt) plus two temperature compensating diodes 
(T; , T’,). The charge parceling circuit uses 33-pF and 69-pF capacitors 
for the +1 and —2 steps and a 2690-pF capacitor for the integrator. 
This allows approximately a 12-millivolt integrator step voltage. Note 
that the ratio of the charge parceling capacitors represents only a 
portion of the integrator step imbalance. The effective size of the positive 
step is decreased by output stage biasing requirements and by the 
effect of parasitic capacitances in the charge parceling network. The 
compensation network is designed to provide approximately 2 percent 
of step size compensation per volt change in the integrator output level. 


IV. CODEC OPERATION 


Tests were performed on codecs fabricated from discrete beam-lead 
resistors and transistors on ceramic substrates. Figure 9 shows a photo- 
graph of the codec ceramic. The ceramic contains the comparator and 
the two integrators needed for a complete codec. Discrete capacitors 


1024 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1970 


O+6V 





O 
+1 INPUT 
(CLOCK) 


—2 INPUT 
(DIGITAL 
INPUT) 


+6V 0 -6V0 INTEGRATOR OUTPUT 


Fig. 7—AM integrator. 


were used for the charge parceling circuit and were externally mounted 
to the model. 

Measurements were made with a 1.544-MHz sampling frequency and 
a 12-millivolt integrator step size. Integrator imbalance was set at 
dv = 0.10. Figure 10 shows typical signal-to-noise characteristics of the 
codec for a 3200-Hz input signal. Similar characteristics are observed 
for other input frequencies. Signal-to-noise ratio is well above transmis- 
sion objectives. Quiescent quantizing noise is measured at less than 12 
dBrnC. 


Dips in the signal-to-noise curve are caused by the sawtooth error 
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Fig. 8—Interconnecting AM comparator and integrator blocks to form a codec. 





Fig. 9—Discrete beam-lead resistor and transistor model of AM codec. 
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Fig. 10—Measured AM signal-to-quantizing noise ratio for a 83200-Hz input signal, 
fs = 1.544 MHz, quiescent quantizing noise = 12 dBrnC. 


waves generated by the integrator step imbalance.’ The largest dip in 
the curve (Fig. 10) occurs when the input signal slope and the funda- 
mental sawtooth slope are approximately the same magnitude. The 
input signal will reduce the effective slope (and therefore increase the 
length) of the fundamental sawtooth; and as the sawtooth frequency is 
reduced to voiceband frequencies, additional noise begins to appear in 
the voiceband. A worst-case condition is encountered when the resulting 
fundamental sawtooth error wave has been reduced to the midband 
frequency of the voice signals. For the 3200-Hz input signal this cor- 
responds to approximately a —23-dBm (900-Q) input signal. Additional 
smaller dips in the signal-to-noise curve are caused when higher order 
sawtooth waves are forced into voiceband; however, their effect is not 
significant. Thus, signal-to-noise problems are avoided provided that 
the integrators contain a sufficiently high unbalance to keep the funda- 
mental sawtooth frequency from the voiceband until high signal levels 
are coded. | 

A number of codecs have been fabricated and tested using discrete 
beam-lead devices. All have shown similar signal-to-noise character- 
istics. Gain (loss) variation has been maintained within 0.2 dB. Power 
dissipation is approximately 250 milliwatts per codec. 

Design of an integrated version of the codec has also been completed 
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Fig. 11—Four integrated AM codecs using thin-film charge parceling capacitors. 


and preliminary tests have indicated satisfactory codec operation. 
The codec is fabricated from three IC chips—one comparator chip and 
two identical integrator chips. Figure 11 shows a ceramic substrate 
containing four complete codecs. Thin-film capacitors are used in the 
charge parceling circuits. To relieve fabrication tolerance requirements, 
the integrated circuit models use approximately twice the capacitor 
values of the discrete model. 


V. CONCLUSIONS 


Delta modulation techniques offer highly integrable and economical 
analog-to-digital conversion elements. We have described a delta modu- 
lation codec design suitable for voiceband applications where conversion 
economy and quality are more important than the transmission band- 
width requirements. 

Techniques are utilized in the design to force a large portion of 
quantizing noise out of voiceband by using a controlled step imbalance 
in the integrator. Excellent noise characteristics are achieved. The design 
attempts to take advantage of integrated circuit techniques and thin- 
film capacitor techniques by relying on matching of device character- 
istics and on accurate capacitor ratio tolerances for reproducing coding 
characteristics. Gain (loss) variations have been maintained within 


1028 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1970 


+0.2 dB using codecs fabricated from discrete beam-lead transistors 
and resistors. 
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APPENDIX 


Charge Parceling—Effect of Junction Capacitances on Integrator Step 
Size 

The +1 step portion of the charge parceling circuit, including transis- 
tor junction capacitances, can be represented as shown in Fig. 12. 
For AV’ S Voz, + Vuz,, the circuit is approximated by Fig. 18. 
Before 7, beings to conduct, AV’ must increase by Vgz, + Vaz, from 
its initial value. This requires a voltage swing of AV” at the input, where 
AV” < AV. The magnitude of AV” is given by 


(Vez, + Ver,) (Crs, + Crp, + Crn,) = (AV”’ ae Vor, = Ver JC a1 
or 
AV" = (Vaz, + Veet = See Eee one Con), 


+V Cte, 










Crs, Te 


C41 


av t 2 
Vi + Vee. ee 


VI - VBE, 


a. CTS. sar 


Fig. 12—-+1 step portion of the charge parceling circuit. 
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Fig. 13—Approximated circuit (+1 step) for AV’ S Vez, + Var,. 


Thus, charge is dumped on C, only for a portion of the input swing, 
AV — AV”. After T, conducts, the equivalent circuit becomes the one 
shown in Fig. 14, and 


(AV ord AV” rae AV)C 41 a AVC, 


or 


AV, = (AV — AV") atts 
I 241 


and 
O11 = AV, aoe [av _ (Vaz, er V pre) 


: Crs, + Cre, + Gon.) Cay 
( a Cr “++ Cei 


+1 


Note that this does not include the change in the effective +1 step size 
due to biasing current requirements of the output buffer stage. 

A voltage step is also produced on C; due to Cx, ; however, the 
positive and negative portions are essentially the same and the effect 
is self-canceling. The —2 step portion of the charge parceling circuit 
is given by Fig. 15. For AV’ S Vaz, + Vaz, , the circuit is approximated 
by Fig. 16. 

Before T, conducts AV must change AV’ by Var, + Vaz, . The 


Cay 


AV- wif — au 


NEGLECT 





Cts.+ Cte CI 
De 


Fig. 14—Equivalent circuit for +1 step charge transfer. 
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Fig. 15——2 step portion of the charge parceling circuit. 


magnitude of this voltage change at input, AV”, is given by 
(Var, ai Viz.) (Crs, + Corn, + Cre.) — (AV” — Vor, _ Veu,)C-2 
or 


AV” = (Var, + Veal ab Crs, + Cre, + Crs, Ge = Crm), 


Thus, charge is drained from C; during the voltage swing AV — AV”. 
After T, conducts, the equivalent circuit is as shown in Fig. 17, and 


AV, = (AV — AV") a ar 
T 2h = 2 


or 


g-2 = AV; = | av =AVex, + Vax) 





(1 4 Crs, + Con, + Gre.) a625. * 
C_2 C; = a2C_» 


C-2 


AV 
aia Cts, + CTE, + CTE, 


Fig. 16—-Approximated circuit (—2 step) for AV’ S Vaz, + Vaz,- 
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Fig. 17—Equivalent circuit for —2 step charge transfer. 
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On the Performance of Digital Modulation 
Systems That Expand Bandwidth 


By V. K. PRABHU 
(Manuscript received December 19, 1969) 


It ts well known that protection against additive gaussian notse can be 
obtained in m-ary digital modulation systems by expanding bandwidth 
or by increasing the channel signal-to-noise ratio. It is also well known 
that arbitrarily small error probabilities can be attained in digital systems 
by using long and complex encoding and decoding procedures. Based on 
the resulis of Shannon and Slepian, we derive for an optimal system lower 
bounds to the channel signal-to-noise ratio for various probabilities of error, 
for vartous bandwidth expansions, and for a processing interval not greater 
than the signaling interval of the source. It is assumed that all m characters 
have equal a priort probabilities and that maximum Itkelihood detection 
is used in the receiver. For a bandwidth expansion of two, and for equal 
energy code words, we also show that the performance of a coherent phase- 
shift keyed system is as good as that of the optimal system. 


I. INTRODUCTION 


Various m-ary digital modulation schemes [such as coherent phase- 
shift keying (CPSK), differentially coherent phase-shift keying 
(DCPSK), frequency-shift keying (FSK), and others] are currently 
under investigation for use in satellite, terrestrial, and other radio 
communication systems.’'?"* In such systems, the transmission channel 
is noisy, and bandlimited, and one is interested in finding an optimum 
form of modulation for the transmission of information from one point 
to another. By optimum form of modulation, we mean that we would 
like to transmit (with a given error rate) as much information as possible 
in a given band of frequencies and for a given amount of (channel) 
signal power. 

The complexity of the equipment required for particular kinds of 
modulation, or other considerations in the system, may rule out these 
optimum transmission schemes in favor of simple and suboptimum 
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schemes of modulation and demodulation. In order to compare the 
performance of these simpler practical modulation systems with that 
of the optimal systems, it is first essential to investigate the performance 
of these optimal systems. 

In this paper we shall assume that the noise in the channel available 
for communication is gaussian and has a uniform power spectral density 
over all useful frequency bands. [In terrestrial systems, in the frequency 
bands above 10 GHz, close spacings of the repeaters are almost always 
mandatory for reliable communication (during fading conditions pro- 
duced by rain).” If low noise receivers are used in the system, it is pos- 
sible that the total interference power (due to co-channel and adjacent 
channel interferers) received by the system may be very much larger 
than the (thermal) noise power in the system.° In this case, note that 
the total noise corrupting the channel may not be assumed gaussian 
in all modulation systems (especially if the number of interferers is 
small).***] 

It is well known that protection against (additive) white gaussian 
noise can be obtained in m-ary digital modulation systems by expanding 
bandwidth and/or by increasing the channel signal-to-noise (power) 
ratio. In fact, Shannon has shown that it is possible to transmit with 
arbitrarily small error probability the output of a discrete source of 
entropy # over a channel of bandwidth W perturbed by additive white 
gaussian noise of average power N by using signals of average power 
S provided FR is less than W log, (1 + S/N) b/s.°"’ However, such a 
transmission scheme may involve long and complex encoding and 
decoding procedures, and to attain these low error rates it may be 
necessary to provide large storage (or long delay) in the transmitting 
and receiving equipment. Practical modulation systems presently used 
for large scale communication do not in general have such large storage 
capabilities or unlimited bandwidths. Also, we must note that there 
are practical limitations on the average power of a transmitter and the 
power that can be received by a receiver. 

Since most of the practical modulation systems have a certain band- 
width expansion ” and since bandwidth expansion usually improves 
the (noise) performance of the system, we shall now investigate the 
optimum performance of digital modulation systems that have a (chan- 
nel) bandwidth expansion n, a finite channel signal-to-noise power 
ratio S/N = p’, and a processing interval which cannot exceed one 
signaling interval T' of the source.* The message source is assumed to 


* It is assumed that the time interval over which the channel is used in decoding 
one message symbol cannot exceed 7’. 
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be of bandwidth W, , and the channel bandwidth is assumed to be IV. 
Further, it is assumed that all m characters (the output of the discrete 
message source consists of m different characters or symbols) have equal 
a priort probabilities and that the characters generated by the message 
source are statistically independent of each other. We also assume that 
a maximum likelihood detection scheme is used in the receiver. 

For such a system, we shall evaluate the lower bounds to the character 
error probability P(m, n, p”) of the optimal system so that we can com- 
pare its performance with that of any practical modulation system. 
For a given error rate, the difference between the signal-to-noise ratio 
required by the optimal system and that required by the practical system 
will then be a measure of the quality of performance of the practical 
modulation system. 

Here, we would like to note that our approach is identical to that of 
Slepian in Ref. 8 in which he evaluated upper and lower bounds to 
P(m, n, p”) for n odd, n = 5, and m = 128 (numerical results form = 
32, 64, and 256 can also be found in an unpublished memorandum by 
Slepian’) when 1 = P(m,n, p’) = 107°.* In addition to giving numerical 
results when P(m, n, p’) < 107° (error rates as low as 10~° are desired 
in some digital modulation systems’) for m = 2, 4, 8, 16 and 32, we 
give a method of evaluating upper and lower bounds to P(m, n, p”) 
for all values of n = 2. We also point out the special significance of 
n = 2, and give closed form solutions for the lower bound when n = 
2, 3 and 5. 


II. COMMUNICATION SYSTEM MODEL! 


The m-ary digital modulation system that we shall consider in this 
paper is assumed to have a signaling interval 7’. Since we assume Nyquist 
rate signaling, we shall assume that 

1 

= ON,” (1) 
where W, is the bandwidth of the message source. Eivery T seconds, 
the message source generates one of m characters or symbols. Since the 
characters generated by the message source are assumed to be sta- 
tistically independent, the entropy R of the message source is given by 
R = 2W, log, m b/s. (2) 


T 


* Note that Slepian uses P(m, n, p?) in determining the threshold in analog modu- 
lation systems that expand bandwidth. 

t+ Compare the communication system model that we discuss in this paper to 
that given in Ref. 8. 
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If S is the average power in the channel of bandwidth W, it follows 
from Ref. 6 that it is possible to transmit with arbitrarily small error 
probability if and only if 


2W, log. m S W logs (1 + S/N) (3) 


when there are no restrictions on the way in which the transmitter and 
receiver operate. 
Equation (3) can be shown to yield 


S/N =m" — 1 (4) 


where 
n= ‘a = 2TW = the bandwidth expansion factor. (5) 
0 


The lower bound to S/N given by equation (4) is then the smallest 
signal-to-noise ratio required to transmit (with arbitrarily small error 
probability) an m-ary digital signal through a channel of bandwidth 
expansion » when there are no restrictions on the transmitting and 
receiving equipments. This lower bound to the signal-to-noise ratio 
S/N is shown in Fig. 1. 

In the communication system model we are considering in this paper, 
there is no provision for the storage of a large number of characters, 
and hence it is to be expected that we will need signal-to-noise ratios 
much larger than those given in Fig. 1 (when arbitrarily low error rates 
are desired). Since the processing interval of our communication system 
is assumed not to exceed one Nyquist interval (corresponding to the 
message source), the channel signal corresponding to time 7’ can be used 
to decode one and only one message symbol.* If the bandwidth of the 
channel is W, the channel signal can be completely specified by samples 
taken every 1/2W seconds. In the Nyquist interval 7, there are then 
n = 2WT channel samples. We are then assuming that n channel 
samples are used to decode one message character, or that each of the 
m message symbols are mapped into a channel vector having n com- 
ponents.* (In the error-free transmission scheme of Shannon, ¢, 4 = 1, 
successive message symbols are mapped into one channel vector. By 
making ¢ sufficiently large, by choosing the m’ channel vectors appro- 
priately and by decoding appropriately at the receiver, the results given 


* This is equivalent to saying loosely that the communication system does not 
have storage capability for more than one Nyquist interval.® 

t There are certain subtle points involved in this assumption. Some of these points 
and their implications are discussed in Refs. 10 and 11. 

+ That is, we construct a dictionary that associates with each of the m message 
symbols a particular n-dimensional channel vector. 
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Fig. 1—Lower bound to the signal-to-noise ratio for different bandwidth expansions 
and ideal signaling. 


by Shannon may be obtained.* It is to be noted that it is essential to 
store £ message characters before we can generate the appropriate 
channel vector. In our scheme of transmission, we put ¢ = 1, and in- 
vestigate the optimum performance of the system.) 

As far as the average power in the channel is concerned, the channel 
vectors can be chosen in a variety of ways.’” Since all the characters 
are assumed to have equal a priort probabilities, we will make the 
assumption that all of them have the same average power S (or the 
same energy S7’).* 

Since the noise corrupting the channel is assumed to be white, each 
component of each channel vector is perturbed independently by the 
addition of a gaussian variate of mean zero and variance N. 


III. EVALUATION OF PROBABILITY OF ERROR P(m, n, p') 


Since the channel vectors corresponding to different message symbols 
have the same average power S, all these n-dimensional vectors ter- 


* Other types of restrictions (such as maximum power, maximum average power, 
and so on) can also be put on the signal vectors to analyze the communication system 
given in our paper. Since all symbols are assumed to be equally likely, we do not 
consider a system in which there can be unequal distribution of power among different 
channel vectors. In particular, some amplitude modulation systems (such as single- 
sideband AM) do not satisfy the requirement that channel vectors corresponding 
to different message characters have the same average power S, and hence such 
systems are not covered in this paper. 
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minate on the surface of a sphere of radius (nS). By choosing the chan- 
nel vectors appropriately, and by using maximum likelihood detection 
receiver, it can be shown” that the minimum probability of error 
P(m, n, p’), averaged over all symbols, satisfies the inequalities 








2 yy 2 S 
Q(m,n, p’) = P(m,n, p') < Q(m,n, p'), & = 7, (6) 
where?” 
O(n, m, 6) = L(Oain om 0°) @ 
Q(m,n, p) = U(0... nN, p n (8) 
Om,n 
sn 2 
27 sin ey (o=1 1 
= > ae —~ 4£sin? Om,n 9 19 ’ 9 
ve | sin”-” p du ©) 
0 
x(., nN, p n) = | pr(r) dr, (10) 
6 
u(6,n, p n) = 1(6,n, p m) ao [ D(A) Q(A) ar, (11) 
ee aaa sees 
QA) = r(* " i i. sin” ~ udp, (12) 
2 
T'(k) = / ea" dx, (13) 
0 


(n — 1) exp (—o? Bsin’ r) sin” ” 
2”? ret + 1) 


fr r* exp | —& = ela! cos 0 — ota) — Nf) dr, (14) 


and I,(a, 8) is the incomplete beta function given by 


Did) = 


Le.) = awl Ma - od, OSeS1; U8 


Ba, B) = | A =f dik (16) 
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The significance of the inequalities in equation (6) may be explained 
as follows. No m-ary digital modulation system with a given bandwidth 
expansion 7 and a given signal-to-noise ratio p’ can achieve a lower 
probability of error than that given by Q(m, n, p’).* Also, we observe 
from equation (6) that m-ary digital modulation systems can be built 
to have an error probability given by Q(m, n, p’).* 

The bounds given by equation (6) can, therefore, be used in com- 
paring the performance of practical modulation systems with that of 
the optimal systems, and in estimating the quality of performance of 
the practical modulation systems. For a given probability of error, and 
mand n, we can compare the signal-to-noise ratio required for a practical 
modulation system with the minimum signal-to-noise ratio predicted 
by equation (6) for the optimal system.? 

Since we are interested in this kind of comparison and since there 
seem to be theoretical considerations which show’ that Q(m, n, p’) is a 
very weak bound [it is shown in Ref. 10 that some explicit codes can be 
constructed to make P(m, n, p”) very close to Q(m, n, p’)], we shall not 
discuss the upper bound Q(m, n, p’) any more in this paper. 


IV. EVALUATION OF LOWER BOUND Q(m, n, p') 


For the sake of comparing the performance of proposed modulation 
systems with that of the optimal systems, it is essential to evaluate the 
lower bound Q(m, n, p’) for different values of m and n. The evaluation 
of Q(m, n, p’) is rather difficult and is usually done by using a digital 
computer. Slepian® has given methods of evaluating this bound when 
n is odd, and numerical values are given for Q(m, n, p) when 1 = 
Q(m, n, p) 2 10°’, m = 82, 64, 128 and 256, and n = 5. 

Since error rates of less than 107° are desired in digital systems and 
since no numerical results are available when m < 32 (in general, it is 
easier to build digital modulation systems with low values of m), we 
shall give some further numerical values for the cases considered by 
Slepian. In addition, we shall give a method to evaluate Q(m, n, p’) 
when 7 is even and point out its special significance when n = 2, 

First we review briefly Slepian’s method of evaluating Q(m, n, p’).’° 
For any given values of m and n, we evaluate by interpolation the value 
Of Om.n(O < On,» & 7/2) from the set of tables for J,(a, 8) given in Ref. 


* Of course, we assume that the digital modulation system satisfies other require- 
ments given in this paper. 
For n = 1 or 2, it can be shown that we can make Q(m, n, p?) = P(m, n, p?). 
Also, for all (integral) n, and m = 2, we can make Q(2, n, p?) = P(2, n, p*). 
* In making this comparison, we have to assume that we can estimate the band: 
width expansion factor n for the practical modulation system. 
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13. Since it has been shown” that 
L(6,n, 0°) = L(6,n — 2,0”) + cos 6G(6,n — 2, 0°), n> 8, (17) 


oS : (18) 


G(6,n, 0) = o cos Osin 6b,G(6,n — 1, 0°) 





+ i sin? 66(0,n — 2,07), n> 2, (19) 
n—2 
On = TT Ont n> 2, (20) 
and 
b, = 2", (21) 
2 
be —_ x ’ (22) 
G(6, 1, 0°) = 4 exp (—o’ sin’ 6)[2 — erfe (c cos 8)], (23) 
G(0, 2, 0°) = =sin 6 exp (—o*) + “Jsin 6 cos 0G(0, 1, 0°), (24) 
L(6, 3, 0°) = 4 erfc (c) + cos 6G(6, 1, o°), (25) 
where 
arte (a= S i exp (~P) dt = 1 — erf (2), (26) 


Q(m, n, p’) can be evaluated for odd n from equations (7), (9) and (17) 
through (25). However, for even n, we cannot use Slepian’s method of 
evaluating Q(m, n, p’) unless we can find an explicit expression for 
L(6, 2, 0°). 

Now it has been proved’*’**’”* that moderately high values of n(2 < 
n < 5) are required for some digital modulation systems in order to 
optimize transmission rates per unit bandwidth. Since we would like 
to compare these systems (and other systems with similar bandwidth 
expansions) with the optimal systems, we shall first express Q(m, n, p”) 
explicitly for odd n, and n S 5 before we discuss the evaluation of 
Q(m, n, p’) for even n. 


4.1 Lower Bound Q(m, n, p”) for n = 3, 5 


For n = 8, equations (7), (9), (10), (18) and (14) can be shown to 
yield 
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Q(m, 3, p°*) = : (eric [o(3)*] + (1 — 2/m) 


seep | -%2 (1 - 4)| {2 = pnel | oa’ (1 = 2)}}). (27) 


For m = 2',1 S ¢ S 5, we have evaluated Q(m, 8, p”) and have shown 
the results in Fig. 2. 

Let us now compare the performance of an FSK system (using square- 
wave modulation, ideal discrimination detection with an integrate- 
and-dump circuit as the post-detection filter) with a bandwidth expan- 
sion of 3 with the performance of the optimal system. For the FSK 
system the symbol error probability P» sx can be shown’’’” to be given 
by 


cot {r — 2 
Pose ™ pies AE Me es - exp | 20" sin? iz nA cee 2 | , 


oo E n— 2 4m—1 
cos | = 
2m—1 


For n = 38 and m = 4 and 8 we have plotted Pysx and Q(m, 3, p’) in 
Fig. 3. Noting that the error rate of the optimal system can be made 
close to Q(m, n, p’) for small n, it can be observed from Fig. 3 that the 
error performance of the FSK system is inferior to the optimal system 
by several decibels. However, note that the formula in equation (28) 
is an asymptotic formula and that we have calculated the bandwidth 
expansion factor for the FSK system by Carson’s rule. Also, note that 
we have taken the bandwidth of the message source to be 1/27’, where 
T is the signaling interval of the source. If all these assumptions are 
reasonable, we must conclude from Fig. 3 that the performance of the 
FSI system is far from being optimum. 

Let us now consider n = 5. For n = 5, we have 


Q(m, 5, p’) 








p>1, n< m+. (28) 


1 1, dp sin’ 6 cos’ 6 
=5  exfe [o(8)') + gr exp {-'8] 


a 


+ 4 cos 6 exp {—p $sin’ 6}(sin’ 6 + 5p cos’ Osin’ 6 + 2) 


-{2 — erfe [p(3)? cos an | ; (29) 


1042 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1970 








ces RAE 
RLhie aie 


-2 e) 2 4 6 8 10 12 14 16 18 20 22 
SIGNAL-TO-NOISE RATIO IN DECIBELS 


Fig. 2—Lower bound Q(m, n, p?) for n = 38. 
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Fig. 3—Comparison of the FSK system with the optimal system, n = 3. 
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where 


cos @ 


Seanell A acd (2 -1)| 
2 cos E + 3 COS | 1}, (80) 


and 
sin @ = +(1 — cos’ 6), (31) 


Form = 2,1 S$ ¢ S 5, we have plotted Q(m, 5, p”) in Fig. 4. 

For n odd, and n > 5, derivation of an expression for Q(m, n, p’) 
becomes rather tedious and long, and we shall not give these expressions. 
However, for n = 7, 9, 11, 13 and 17, we have calculated Q(m, n, p”) 
form = 2',1 <$¢ S 5, and the results are given in Figs. 5, 6, 7, 8 and 9. 
These numerical results which add to the results given by Slepian were 
obtained by using his method (see Appendix A for an alternative 
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Fig. 4—Lower bound Q(m, n, p?) forn = 5. 
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Fig. 5—Lower bound Q(m, n, p?) forn = 7. 


4.2 Lower Bound Q(m, n, p’) for n even 


Since we can calculate Q(m, n, p’) from L(6, n, o°) and since the re- 
currence equation relates L(0, n, 0”) to L(@, n — 2, o°) [see equation 


(17)], we can calculate Q(m, n, p’) for even nif we can calculate L(8, 2, 0”). 
It can be shown that 


1(0, 2, 0°) = | pald, 0°) ad (32) 
7] 
where 
p(d\,o) = * [e°" ++ o(m)? cos de” * {1 + erf (c cos A)}]}. (33) 


Noting that p.(d, o”)/2 is the probability density of the phase angle 
of a sinusoidal carrier of amplitude (2A)* corrupted by random gaussian 
noise of average power N (signal-to-noise ratio o” = A/N), we have 


DIGITAL MODULATION SYSTEMS 1045 

















NN \ao 
AAW ET 
ott 


-6 -4 =-2 












107! 








SIGNAL-TO-NOISE RATIO IN DECIBELS 
Fig. 6—Lower bound Q(m, n, p?) forn = 9. 


shown in Appendix B that L(6, 2, o”) can be calculated for any 6. Hence 
we can calculate Q(m, n, p’) for any even n. Now it can be shown‘ 
(see Appendix C) that* 


L(6, 2, 0°) = 3 erfe (0), = 7/2; (34) 
L(6, 2, 0°) = erfe [o/(2)*] — 1 erfe? [o/(2)*], = 7/4 


and 


3 erfe (o sin 6) + max (0. 3 erfe (c sin 6) 


a ms exp (—o’)[1 — zo exp (o°) erfc oi} 


IIA 


L(6,2,0°) < erfe (csin 6), 0< 68 7/2; (35) 


* Some of these results can be obtained from Ref. 4 by putting Q = 0. 
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Fig. 7—Lower bound Q(m, n, p?) for n = 11. 


where 
> B- 
max {a, b} = ‘ a 6; (86) 
b, a<b. 


Since the upper and lower bounds to L(6, 2, o”) cannot differ by more 
than a factor of two and since all quantities involved in equations 
(17) through (25) are positive, we shall now write a modified bound 


Q’(m, n, p°) = L' (Onn, 2; 0°) (37) 
where 
L'(6,n, 0”) = L'(0,n — 2, 0°) + cos 6G(6,n — 2,07), n> 83, (38) 
and 

L'(6, 2, «”) = 4 erfe (o sin 6). (39) 
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Fig. 8—Lower bound Q(m, n, p?) for n = 13. 


Since 
L(6, 2, 0°) = L'(0,2,0°), O<68 7/2 
note that 
P(m, n, p°) 2 Q’(m, n, p’). 


Let us now consider the particular case n = 2. For n = 2, 


or 





(40) 


(41) 


(42) 


(48) 
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Fig. 9—Lower bound Q(m, n, p?) forn = 17. 


Since 6,2 = m/m, and p2(d, a) is given by equation (33), it can be 
shown’’’’*'”° from equations (7) and (10) that Q(m, 2, p’) is equal to 
the error probability obtained in m-ary coherent phase-shift keyed 
(CPSK) systems. Also, for n = 2, it can be shown” that 


P(m, 2, p’) = Q(m, 2, p’). (44) 


It, therefore, follows that the error rates obtained in m-ary coherent 
phase-shift keyed systems are identical to those obtained in any m-ary 
digital modulation system that has a bandwidth expansion of two.* 
Hence we conclude that the error rates of any digital modulation system 
with a bandwidth expansion of two cannot be lower than the error rates 
of CPSK systems for all m. 

Since the error rates of CPSK systems have been investigated in 


- CPSK systems can be shown ”:?! to have approximately a bandwidth expansion 
of two. 
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detail,’*"**’”? we shall not give numerical values of Q(m, 2, p’) in this 


paper. However, we would like to note that’®'’'”° 
Q2, 2, p) 3 erfe (p), (45) 
Q(4, 2, p°) = erfe [o/(2)*] — 4 erfe? [o/(2)'], (46) 


and 


4 erfe (p sin r/m) -+ max {0 4 erfe (p sin r/m) 


= fneii exp (—p’)[1 — zp exp (p°) erfe coh 
< Q(m, 2, p’) < erfe(psinz/m), m> 2. (47) 


For signal-to-noise ratios greater than 5 dB, it can be shown* that 
Q(m, 2, p’) & erfe (p sin x/m), m= 4, (48) 


and that the error in this approximation is less than 5 percent. 

For n > 2, Q(m, n, p”) can be evaluated by using methods presented 
in Appendix B and using equations (17) through (25). However, this 
is usually difficult and tedious, and since Q(m, n, p’) and Q’(m, n, p’) 
can at most differ by a factor of two, we shall use the modified bound 
Q’(m, n, p). Observe that Q’(m, n, p’) can easily be evaluated from 
equations (18) through (24), and (37) through (89).* 

For n = 4, 8, 12 and 16, and m = 2‘,1 S$ ¢ S 5, we have evaluated 
Q’(m, n, p') and the results are shown in Figs. 10, 11, 12 and 13. 


V. DISCUSSION AND CONCLUSIONS 


Based on the results of Shannon and Slepian, we have derived, for 
different probabilities of error, lower bounds to the channel signal-to- 
noise ratio required by optimal systems to transmit the output of an 
m-ary message source through a channel of bandwidth expansion n. 
We assume that the channel is perturbed by additive white gaussian 
noise, all channel signals have the same average power S, and that 
the processing interval for decoding one message symbol is not greater 
than one signaling interval. When this interval can be arbitrary and 
when the transmission rate is not greater than the channel capacity, 
it is well known that the probability of error can be made arbitrarily 


* For any n, note that 02... = 2/2, and that Q(2, n, p2) = Q’(2, n, p?) = Lerfe 
[o(n/2)*/?]. 
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Fig. 10—Lower bound Q’(m, n, p*) forn = 4. 


close to zero by using long and complex encoding and decoding 
procedures. 

For different practical modulation systems, we can then compare the 
signal-to-noise ratio required for different probabilities of error with 
the lower bound given in this paper for optimal systems. This will aid 
us in deciding about the optimality or nonoptimality of different sys- 
tems, and in evaluating the quality of performance of different modula- 
tion systems. 

By using Slepian’s method, we evaluate this lower bound for odd n, 
and m = 2‘,1 S$ ¢ S$ 5. We also give a method of evaluating the lower 
bound for even n, and derive a simpler modified lower bound for n even, 
and n > 2. This modified lower bound has been evaluated for n even, 
andm = 24,1 <¢<5. 

For a bandwidth expansion of two, the performance of a coherent 


DIGITAL MODULATION SYSTEMS 1051 


@ 

















10°9 








10710 








* 2 
Il 


eee TO- ee ee IN DECIBELS 


Fig. 11—Lower bound Q’(m, n, p?) for n = 8. 


phase-shift keyed system has been shown to be as good as that of the 
optimal system. 

A particular FSK system with a bandwidth expansion of three has 
been compared to the optimal system, and it appears that its perform- 
ance is substantially inferior to that of the optimal system. 


APPENDIX A 


Evaluation of Lower Bound Q(m, n, p’) 


In this appendix, we shall give a second method to evaluate Q(m, n, p’). 
It can easily be shown from equation (9) that 


6..= > forall n, (49) 


= 
2 
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Fig. 12—Lower bound Q’(m, n, p*) for n = 12. 
and that 
P(2, n, p’) = Q(2, n, p°) = 4 erfe [p(n/2)*]. 

Hence, we can write 

L( n, :*) = d erfe (0). 

Since 
1(0,n, 0°) = [ p,(d) a, 
v@ 


or 


nx/2 T 
1(6,n, e a = is p,(d) dd + ie ptr) dr, 


3 
It 
iw 





(50) 


(51) 


(52) 


(58) 
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16, n, Py ? = Pa 2) + 2 erfe [p(n/2)'], 


1053 
(54) 
r”* exp (—1r/2) 


-exp (ro(2)? cos A) sin™”” A dd. (55) 


Expanding exp (ro(2)? cos d) into a Taylor series and integrating term 
by term, we have 


(n — 1) exp (—o°) SG [o(2)*]’ [? snes ; 
aE EEE | a exp — r'/2a, 
2a) (ett 1) a / 


aw/2 
| cos’ \ sin” A dn, 
(7) 





_ (w= 1) exp (~e’) >, Ga r(/ +2) 


2(n)} (ett a 


i ti n—t) (A124). 
3 a(t er ae 





(56) 
Equation (56) can be simplified to 











10, n, o?) = S22) $ a) (t+ 








#t) I (f +1 n- t). 
Q2(r 3 rer 9 cos?6 9 ’ 2 
(57) 
Equations (54) and (57) yield 
1 
2) _ 1 2 Fee fe, ao 2 
L(6,n, 0°) = $ erfe (oc) + Nay exp (—oa’) 
_ (20)! (244) (t+ n—t). 
d £l r 9 Leagag 2 ’ 2 (58) 
For m not too large and for large n, it can be shown that 
Tv 
~ /2 


is small. If 6 is small, we can prove that the series given in equation 
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Fig. 13—Lower bound Q’(m, n, p?) for n = 16. 


(58) converges rapidly and that we may alternatively calculate 
Q(m, n, p’) from equations (7), (9) and (58). 


APPENDIX B 


Evaluation of Distribution Function L(6, 2, o”) 
Equations (10) and (14) can be shown to yield 


8 
1(0,2,0°) =1— | plu) du, (60) 
-6 
where 
p(r) = a [exp (—o”) + a(n)? cos \ exp (—o’ sin’ X) 


-{1 + erf (¢ cos d)}]. (61) 
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Note’*~” that p(d) is the probability density function of the phase 
angle \,0 S \ < 2z, of the sum of a sinusoidal carrier (of unit amplitude) 
and gaussian noise [of average power 1/(20°)]. 
We can also show” that 
7] -. Qh... 


eet De Se 62 
L(6, 2,0) =1 is 2k sin ké, (62) 


where 


2) 


hecer = FED exp (— 0? MLM? /2) + Les /D] 





£=0,1, 2, °° (63) 


hoe = 


Roe 


> A ons 1B2e—(one1) ' Lily 2 oy oes (64) 


Antes = Asser = ED exp (~o'/2L/2) + Tenle/D, 


s=0,1,2,-*+; (68) 


and 


na (xo")* 2 
Boyt = Boyyr = (—1) a a es (—o°/2) 


-[E,00°/2) + Tour@"/2)], =p = 0,1, 2, ++. (66) 


I,(2) is the modified Bessel function of the first kind and of order n. 

Since all h,’s can be calculated using either a set of tables or a digital 
computer and since the series given in equation (62) converges, we can 
calculate L(@, 2, o”) for all o and @. 


APPENDIX C 


Evaluation of Upper and Lower Bounds 


From equations (10) and (14), and Appendix B, we observe that 
L(6, 2, o*) is the probability that the phase angle \, 0 < \ < 2z, of a 
sinusoidal carrier of zero initial phase and unit amplitude lies outside 
the range —6 S \ S 6 when it is corrupted by random white gaussian 
noise of average power 1/20”. 

When 6 = 7/2, we can show’’’”° that 


L(6, 2, o”) = 4 erfe (o), 6 = 7/2. (67) 
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When @ = 7/4, we can also show™'” that 
L(0, 2, «”) = erfe [o/(2)*] —2 erfe? [o/(2)?). (68) 


When 0 S @ < 7/2, let the sinusoidal carrier be represented by 
phasor OS in Fig. 14. Let x, and x, represent the in-phase and quadrature 
components of white gaussian noise corrupting the sinusoidal carrier. 
The quantity L(6, 2, o”) is, therefore, given by the probability that the 
terminus of the vector OT lies in areas marked 1, 2 and 3. 

We, therefore, have* 


erfc (o sin @) — a exp (—o’)[1 — ao exp (o’) erfe (0)] 


< L(0, 2, 0°) < erfe (csin 6). (69) 


Also, since L(6, 2, o”) is greater than the probability that the terminus 
of the vector OT lies in areas marked 1 and 2 (or 2 and 3), we can write 


L(6, 2, o°) > 4 erfe (o sin 6). (70) 
Combining equations (69) and (70), we get equation (85). 
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Fig. 14—Derivation of bounds to L(6, 2, o?). 
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Transit-Time Variations in Line-of-Sight 
Tropospheric Propagation Paths 


By D. A. GRAY 
(Manuscript received December 12, 1969) 


We present in this paper transit-time variations in line-of-sight propaga- 
tion paths and systems operating at frequencies up to 80 GHz. We discuss 
variations due to both atmospheric changes (no precipitation) and rain 
and point out some relationships to PCM systems. 


I. INTRODUCTION 


In a recent paper’, J. R. Pierce considered stable synchronization of 
large digital transmission networks, and pointed out that the realization 
of such a synchronized network calls for, among other things, more 
information concerning network transit-time variations. In this paper, 
we seek extreme values for these variations in line-of-sight propagation 
paths in order to provide some of this information. Estimates are given 
for () the maximum variation in transit-time, Atmax , Which one might 
expect over the period of a year, and (27) the maximum time derivative, 
Tmax » Which one might encounter. The estimated values are related to 
digital systems, with most specific examples given for a 500 megabit 
transmission rate. Variations due to changes in the atmosphere (no 
precipitation), and those due to rain are discussed separately. In the 
longer line-of-sight paths achieved in tandem systems, repeaters are 
assumed to be stable, that is, the concern herein is with atmospheric 
variations only. Delays associated with selective fading are not dis- 
cussed, but they are believed not to exceed the given estimates of the 
maximum variations. 

The transit-time 7 is given by the familiar relationship 

1 Pa 
T= n ds (1) 

C Jp, 
where c is the velocity of light, 7 the medium refractive index, ds the 
differential path length, and the limits P, and P, represent the end points 
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of the path. If one assumes An,,,, , the maximum index of refraction 
change which is expected over a given period of time, and if one assumes 
that the changes in n over the entire path S are perfectly correlated and 
equal to Ann.x , then the corresponding maximum transit-time variation, 
Atmax ; 18 given by 


Areas = = (Atma) S. @) 


In a subsequent section, Atm.x Will be computed after estimating Anmax 
A similar set of assumptions concerning the time derivative 7 leads to 
an estimate of 7.x , namely, 


Ninel (3) 


Tmax 


II. MAGNITUDE OF TRANSIT-TIME VARIATIONS IN THE ATMOSPHERE 


Letting the superscript a denote the atmosphere, the estimate of 
An‘, to be used in this section is based on data found in Bean and 
Dutton.” The data comprise eight years of point observations at six 
locations in the United States. The locations are listed in Table I where 
the maximum, minimum, and range An of the index of refraction are 
given in N units, with N = (n — 1) X 10°. A value which approximately 
strikes an average for the six locations listed will be used, namely, 
An’,,, = 1.0 X 10-*. Substituting in equation (2), 


Ar, = 0.333 X 10°°S (4) 


where S is in kilometers. Equation (4) is plotted in Fig. 1. Since the 
atmospheric index of refraction, n*, may be regarded as independent of 
frequency up to 30 GHz’, the curve in Fig. 1 may be considered applic- 
able up to this frequency. 


TABLE J—EXTREMES IN INDEX OF REFRACTION FOR SIX 
LOCATIONS IN THE UNITED STATES 


Location max N min NV AN 
Washington, D. C. 393 277 116 
San Antonio, Texas 387 258 129 
Bismarck, N. D. 368 260 108 
Colorado Springs, Col. 307 214 93 
Salt Lake City, Utah 323 230 93 


Tatoosh Island, Wash. 354 292 62 
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Fig. 1—Estimated maximum atmospheric (no precipitation) transit-time varia- 
tions aie long path length line-of-sight tropospheric communications systems. 


For long paths, Ar%,, is overestimated because the assumption that 
An is perfectly correlated and equal over the entire path becomes un- 
realistic as S increases. However, the overestimation is not excessive 
because those components in An which primarily are due to seasonal 
and diurnal variations are highly correlated and combine to be of the 
order of 50 N units (approximately 30 N units for seasonal and 20 NV 
units for diurnal). Thus, Ar%,,, is conservative to within a factor of two 
for the continental United States. As an example, for a 3000-mile path 
in Fig. 1, A7r2,, = 1.6 us; certainly, it would be no less than 0.8 us. In 
terms of a 500 Mb system, Ar. = 1.6 us is equivalent to an 800 bit 
variation. 

For short path lengths, more detailed data are needed and these are 
in Vig. 2. One notes that, for short paths, a An* which typifies a given 
region should be used instead of the Anz, of 100 N units. Therefore, 
Fig. 2 shows Atmx Versus path length for each of the six locations listed 
in Table I. 

Now consider the problem of synchronizing two clocks separated by 
a distance S. For a digital system of pulse spacing T’, two clocks may be 
considered synchronized if they are in phase to within a factor f of a 
pulse spacing. For the sake of argument in this discussion, we choose 
f = 0.1, and we draw on Fig. 2 the lines f-T for pulse rates of 50, 100, 
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Fig. 2—Maximum atmospheric (no precipitation) transit-time variations for 
short paths. Variations are shown for six locations in the United States. 


200, and 500 Mb. The intersections of the lines f- 7’ with the curves for 
Atmsx Getermine the maximum path lengths for which atmospheric 
transit-time variations may be neglected. For instance, these lengths 
are respectively 5, 2.5, 1.25 and 0.5 km for the 50, 100, 200, and 500 Mb 
rates when the region of concern is Washington, D. C. For some other 
criterion of synchronization, that is for some other f, these path lengths, 
of course, will be different. 

From Fig. 2, it is evident that for any but the shortest path lengths, 
some compensation for refractive index changes must be incorporated 
in a digital system running on a universal clock. Figures 1 and 2, how- 
ever, do not provide information on how rapidly one must compensate. 
This question is answered by first considering the time derivative of 
transit-time variations which are due to turbulence, and then by con- 
sidering those due to the motion of synoptic scale air masses. 


III. TIME DERIVATIVE OF TRANSIT-TIME VARIATIONS 


Table II shows the rms values of the hourly transit-time variations®* 
which are due to atmospheric turbulence. Because of the randomness 


TRANSIT-TIME VARIATIONS 1063 


TasBLE JI—RMS Hovurty Transit-TimMe VARIATIONS 
IN THE TURBULENT ATMOSPHERE 


RMS Delay Path Length Frequency 
(seconds) (miles) (GHz) 

1.5 KX 10-8 2.25 10 
4X 107" ee) 1 

15 X 10712 10 1 

22 X 10-8 60 1 


of turbulent motion, one would expect the rms values to be proportional 
to the square root of the path length S. Thus, on an hourly basis, the 
variations for a transcontinental path work out to be 0.2 ns, which 
is one tenth of the pulse spacing in a 500 Mb system. On the basis of a 
five minute interval, measurements show that rms variations are two 
orders of magnitude smaller than the hourly changes.* Thus it appears 
that compensation for turbulence-induced atmospheric transit-time 
variations can be made on the order of tens of minutes for transcon- 
tinental links, and even more slowly for shorter paths. 

In contrast with the turbulence-related phenomena discussed above, 
the motion of synoptic-scale air masses brings about changes in index- 
of-refraction which are correlated over large regions, that is, for regions 
covering hundreds of kilometers, changes in » will be proportional to 
path length, S, not to (S)*. For instance, an advancing cold front brings 
with it a decrease in temperature 0, a decrease in water vapor pressure 
e, an increase in total pressure P, and an accompanying change AN, 
which is well correlated over the entire frontal advance. 

For the purpose of discussion, a model front is shown in Fig. 3. The 
model front passes transversely across a microwave transmission path. 
The index of refraction change AN occurs over a distance S, , and the 
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Fig. 3—Model of a frontal system moving across a line-of-sight transmission path. 
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front moves at velocity v. This model permits one to estimate the maxi- 
mum rate of change of transit-time Tinax « 
Proceeding with the estimate of 7.x , from Fig. 3, 


= 10 -1 AN r 


max ta 








max 


where t, = S,/v. Substituting in equation (3), 
u AN» 


A 


imax = 0.33 X 107 S. (5) 


max 





Estimates of the quantity AN -/t, | max have been obtained from tem- 
perature, pressure, and relative humidity data continuously recorded 
at Crawford Hill since January 1, 1967. The records were examined for 
events of rapidly changing atmospheric conditions. N was computed 
for conditions just prior to and just after these events using the formula” 


N = 77.6 = +. 3.73 & 10° 3 (6) 


where @ is expressed in kelvins, and P and e in millibars. AN; was taken 
as the difference between the two computed values for each event. Table 
III gives the date, time, AN; , ts , and AN;/t, for the largest events 
recorded. It is noted from the Table that AN 7/t, |max = .0788. Rounding 
off to .08, and substituting in equation (5), 


TaBLE IJJ—InNpEx or REFRACTION CHANGES ACCOMPANYING 
RapPpipLy VARYING ATMOSPHERIC CONDITIONS 





&’ Date Time ANF ta (sec) AN p/ta 
2/28/67 10:30 PM 7.23 900 .00804 
7/14/67 3:00 PM 3.94 600 00657 

10/3/67 9:00 PM —13.67 300 0456 
2/17/68 2:00 PM 13.63 300 0455 
3/29/68 3:30 PM 15.93 900 0177 
4/30/68 6:00 PM 7.55 300 0252 
6/3/68 4:00 PM 13.75 600 0229 
7/2/68 8:00 PM 22.18 900 0246 
7/24/68 2:30 PM 19.84 300 0662 
8/7/68 2:30 AM 18.6 600 0311 
8/15/68 4:00 AM —31.75 1200 0264 
8/17/68 3:30 AM 6.1 300 0206 
8/22/68 9:40 PM —2.42 300 00807 

11/29/68 2:00 AM 3.67 180 0204 

12/5/68 2:00 PM 14.18 180 0788 
6/13/69 2:30 PM 10.12 600 0169 
6/24/69 5:30 PM 7.49 300 0250 
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ima = 2.67 X 1078S, (7) 


The maximum value S over which good correlation of N can occur will 
be taken as 500 km (this value will be discussed subsequently). From 
equation (7), 7mox &¢ 1.36 X 107" sec/sec. In terms of a 500 Mb system, 
Tmax & .0667 b/s, or 1/16th of a bit per second. 

The derived 7mex is thought to be larger than would be encountered 
in practice. Complete parallelism (or coincidence), as shown in Fig. 
3, becomes highly improbable when the path length S approaches 500 
km. Primary reasons are: (7) a front and a transmission path can be 
oriented at angles ranging over a large fraction of + radians; (di) nature 
will not produce fronts composed of straight line segments, but rather 
of curves. Since a line-of-sight propagation route is made up of straight 
line segments, it will be improbable that a front would coincide with it. 
Consequently, it appears that the choice of S = 500 km is extreme, and 
that 7max = 1/16 b/s at 500 Mb is a good upper bound. 


IV. TRANSIT-TIME VARIATIONS DUE TO RAIN 


Letting the superscript r denote rain, then An’ is specified using com- 
putations of the medium refractive index for given rainfall rates under 
the assumption of a Laws & Parsons drop size distribution.” These com- 
putations include the index of refraction of the medium for 6, 16 and 
30 GHz for the rain fall rates 0.25, 1.25. 2.5, 5.0, 12.5, 25.0, 50.0, 100.0, 
and 150.0 mm/hr. For rainfall rates exceeding 150 mm/hr, proportional 
scaling of the 150 mm/hr medium index will be used. 

The nature of rainfall is such that higher fall rates are associated with 
smaller areas of coverage. For this reason, long and short path length 
transit-time variations will be treated differently. For path lengths of 
the order of hundreds of kilometers, we will assume that average-path 
rainrates of about 10 mm/hr can occur. Using a 10 mm/hr rainfall 
rate, An" is equal to 0.83 N units at 6 and 16 GHz, and 0.67 N units at 
30 GHz. The corresponding values of Ar’ are plotted versus path 
length in Fig. 4. Comparison of Fig. 4 with Fig. 1 shcws that the transit- 
time variations one expects from rain are more than an order of magni- 
tude smaller than those expected from the atmosphere itself. This im- 
plies that as far as the dynamic range of compensation equipment in 
synchronized digital systems is concerned, atmospheric variations (other 
than rain) are the determining factor for long paths. 

For short paths, rainfall rates much greater than 10 mm/hr often 
occur. Using results for New Jersey,’ transit-time variations for short 
paths are computed using the refractive index values found in Table 
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Fig. 4—Estimated maximum rain-caused transit-time variations for long path 
length line-of-sight communication systems. 


IV; they are plotted in Fig. 5 with P as a parameter, where P equals 
the number of minutes per year the given variation will be exceeded. 
Variations to be exceeded 0.5 min/year and 5 min/year are shown, and 
as before, the rainfall variations are an order of magnitude smaller than 
the atmospheric variations (Fig. 2) for the corresponding path lengths. 

When viewed in terms of the nature of convective showers (showers 
exhibiting high rainfall rates), Fig. 5 aids in estimating the upper bounds 
for the time derivative of rain-caused transit-time fluctuations. Con- 
vective showers can introduce large attenuation in a path rapidly with 
onsets of the order of tens of seconds. In Fig. 5 the intersection of the 
criterion line for a 500 Mb system with the 16 GHz curve for 0.5 min/yr 
occurs at a path length of 10 km. If the onset of the rain over this path 
were to occur in 10 seconds, then 7,,x = .01 b/s, which is a factor of 6 
smaller than the time derivative for the fronts considered previously. 


IV. CONCLUSIONS 


The maximum transit-time variations encountered in the troposphere 
are due primarily to changes in the gaseous atmosphere rather than 
rain, and may amount to 1.6 us for a transcontinental path; this converts 
to 800 bits in a 500 Mb system. The maximum time derivative of the 
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TasLeE [V—InNpEXx oF REFRACTION VARIATIONS EXCEEDED 
P Minutes per YEAR For Various Para LENGTHS 





(a) P = 5 min/yr. 





Path Average 
Path Length Rainfall Rate | An™ at 6 GHz | An’ at 16 GHz | An’ at 80 GHz 





(km) (mm /hr) (N units) (N units) (N units) 
1.3 140 9.44 7.42 4.64 
2.6 135 9.11 7.18 4.51 
5.2 110 7.46 5.98 3.86 
7.8 90 6.14 5.02 3.3 

10.4 80 5.48 4.54 3.0 


(b) P = 0.5 min/yr. 





Path Average 
Path Length Rainfall Rate | An’ at 6 GHz | An’ at 16 GHz | An’ at 30 GHz 





(km) (mm/hr) (N units) (N units) (N units) 
1.3 190 12.8 10.1 6.21 
2.6 175 11.78 9.3 5.72 
5.2 150 10.1 7.9 4.9 
7.8 130 8.78 6.94 4.38 

10.4 110 7.46 5.98 3.86 

0.4 0.20 


~--- 30 GHZ 
16 GHZ 
—-— 6 GHZ 
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Fig. 5—Rain-caused transit-time variations for short path lengths. The parameter 
P equals the number of minutes/year the given variation will be exceeded. 
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transit-time variation is due to the motion of fronts, and has been derived 
herein: 7max = 1.86 X 107” sec/sec which is equivalent to 1/16th of a 
bit at 500 Mb. Neither Art, nor 7msx appears so large as to prohibit 
the synchronization of digital systems transmitted over line-of-sight 
propagation paths. 


Vv. ACKNOWLEDGMENTS 


The author expresses his appreciation to J. R. Pierce for stimulating 
this study, to D. C. Hogg for discussions and to D. Setzer for computa- 
tions. 


REFERENCES 


1. Pierce, J. R., ‘Synchronizing Digital Networks,” B.S.T.J., 48, No. 3 (March 
1969), p. 615-636. 

2. Bean, B. R., and Dutton, E. J., Radio Meteorology, Monograph 92, Washington, 
D. C.: National Bureau of Statistics, 1966, pp. 78, 409-414. 

3. Beard, C. I., ‘‘Phase Quadrature Components of 10.4 GHz Scattered Field on a 
oa Tropospheric Path,’’ Proc. IEEE, 56, No. 8 (August 1968), pp. 1398- 
1399 


4. Herbstreit, J. W. and Thompson, M. C., ‘‘Measurement of the Phase of Radio 
Waves Received over Transmission Paths with Electrical Lengths Varying as 
a fe ie jucoephens Turbulence,’’ Proc. IRE, 43, No. 10 (October 1955), 
pp. 1391-1401. 

5. Setzer, D., “Computed Transmission Characteristics of Rain at Microwave and 

Visible Frequencies,” to be published in October 1970 B.S.T.J. 

6. Hogg, D. C., “Statistics on Attenuation of Microwave by Intense Rain,” 
B.8.T.J., 48, No. 9 (November 1969), pp. 2949-2962. 


Joint Optimization of Automatic 
Equalization and Carrier Acquisition 
for Digital Communication 


By ROBERT W. CHANG 


(Manuscript received January 22, 1970) 


In this paper, we analyze single-sideband amplitude-modulation digital 
communication systems to develop a method for jointly and optimally 
setting the carrier phase and the automatic transversal equalizer of such 
systems. Mean-square equalization error 1s used as the performance 
criterion. We develop a simple receiver structure and study the convergence 
of the method. Exact locations of the stationary points in the parameter 
space are determined and the classifications of the stationary points are 
obtained. We show that the mean-square equalization error has only global 
minima and saddlepoints, but not local minima and maxima. Thus, the 
mean-square equalization error will converge to the absolute minimum by 
the proposed method, regardless of the initial settings of the parameters. 
A simple condition on the step sizes of the adjustments 1s also obtained 
which ensures the convergence of the process. Explicit formulas of the 
joint optimum parameter settings and of the corresponding minimum 
mean-square error are obtained. For illustration pur poses, a single-sideband 
digital communication system using a five- or nine-tap transversal equalizer 
is simulated on a computer. Both theory and simulation show that the 
equalization error depends critically on the carrier phase when the number 
of equalizer taps ts not large, and that the minimum equalization error 
can be obtained by using the proposed method. 


I. INTRODUCTION 


In single-sideband amplitude-modulation digital communication sys- 
tems with transversal filter equalization,’’? the adjustment of the 
carrier phase is critical to the system’s performance when the number 
of equalizer taps is not large. In this paper, a method is proposed for 
setting the carrier phase jointly with the automatic equalizer to minimize 
the mean-square equalization error. 
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We formulate a mathematical model of this study in Section II; 
in Sections III and IV, we analyze the system and develop a receiver 
structure. The problem of convergence is studied in Sections V and VI 
to determine if the equalization error will converge to the absolute 
minimum by the proposed method and whether such convergence 
depends on the initial settings of the parameters. We also consider 
step sizes of the adjustments. In Section VII, we derive explicit formulas 
for evaluating the system’s performance. A voiceband data communica- 
tion system is simulated on a computer to test the proposed method 
and the results are described in Section VIII. 


II. MATHEMATICAL MODEL 


As shown in Fig. 1, a single-sideband amplitude-modulation system 
is considered. When an impulse 6(¢) is applied to the transmitter input, 
a signal a(t) is received at the receiving filter output. The Fourier 
transform of a(é) is denoted by A(f). (The Fourier transform of a 
function will be consistently denoted by the appropriate capital letter.) 
It is assumed that A(f) is band-limited between f, and f,, that is, 


A(f) #0, only for fi <|f| < fe. (1) 


The signal a(¢) is demodulated as shown in Fig. 1, where the demodulat- 
ing carrier frequency is f, . In single-sideband systems 





TRANSMISSION 
MEDIUM 


|w(F) [eo os? 









s(t) 


Fig. 1—An amplitude modulation system with coherent detection and transversal 
filter equalization. 
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IIA 


de fi (2) 


or 
de tax (3) 


The demodulating carrier phase is denoted by @. The demodulator 
output is a signal y(¢) with Fourier transform I'(f). Since A(f) is band- 
limited, I'(f) is also band-limited, that is, 


rG)=0, |f|> fo (4) 


where fy is fe — f, when f, is f, or fz (fo is larger than f. — f, if f, is less 
than f, or larger than f,). 

As shown in Fig. 1, the channel is equalized by a conventional trans- 
versal equalizer consisting of 2N + I taps with gainse, ,n = —N toN, 
spaced at 7')-second intervals, where 


T) = oA seconds. (5) 
When an impulse 6(¢) is applied at the transmitter input, the trans- 
versal equalizer output is s(t). Clearly s(t) is the overall impulse response 
of the system. The receiver parameters (that is, the demodulating 
carrier phase @ and the equalizer tap gains e, , nm = —N to N) will be 
set to minimize the difference between the overall impulse response 
s(t) and a desired impulse response q(é). The familiar mean-square 
error criterion is used. That is, @ ande,,n = —WN to N, will be jointly 
set to minimize the mean-square error 


&= f(s) — a(0F at. (6) 


For brevity, the tap gains e,, n = —N to N, will be abbreviated 
{é,} in the sequel. 


Ill. ANALYSIS 


In this section, we analyze the system to develop a receiver structure 
for jointly setting @ and {e,} by the method of steepest descent. 

In analyzing carrier signals, it is most convenient to use Hilbert 
transform techniques. As is well known,’ the demodulator output y(é) 
is related to the input a(t) by 


y(t) = cos (2rf.t + @)a(t) + sin (2rf,t + 6)4(2) (7) 
where d(¢) is the Hilbert transform of a(t). {When dealing with lengthy 
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time functions, we shall sometimes use the sign 3C, that is, 3[a(d)] = 
a(t).} 
It is seen from Fig. 1 that 


N 


s(t) = D> en(t — nT, — NT,). (8) 


n=—-N 


We shall use the partial derivatives 0&,/00 and 0&,/de, in the method 
of steepest descent. Since q(t) is independent of 6, we obtain from 
equation (6) 


Sf =f tec) — (0) SO a. ( 


In writing equation (9), the order of differentiation and integration 
has been exchanged [the underlying conditions for making such an 
exchange are easily satisfied by s(t) and q(t) encountered in communica- 
tion systems]. Substituting equation (7) into equation (8) and taking 
the partial derivative give 


N 
ol = >> e,{—sin [2rf.(¢ — nT, — NT.) + Ola(t — nT. — NT) 


+ cos [2rf,(¢ — nT) — NTo) + O]G(¢ — nT) — NT,)}. (10) 


From equations (8) and (7), 


&) = > e,[3C{ cos [2rf.(é — nI') — NT,) + Olaté — nT. — NT,)} 


+ se{sin [2rf.(¢ — nT, — NT.) + 6]4(t — nT) — NT>)}]. (11) 


In single-sideband modulation, we have either inequality (2) or 
inequality (3). Let us consider inequality (2) first. When inequality 
(2) holds, the frequency spectrum of a(t), A(f), does not overlap the 
spectra of cos 2af,t and sin 2zf,t. Furthermore, A(f) occupies a higher 
frequency band; therefore, equation (11) becomes 

KC{cos (Qrf,t + é)a(t)] = cos (2rf,t + Aad), 
and 
5C[sin (Qrf,é + @)d(4)] = —sin (2zf.¢ + d)a(d), ee 


Substituting the above into equation (11) gives 
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N 


a(t) = >> e,{cos [2rf(t — nT, — NT.) + 6la(t — nT, — NT) 


n=-N 


— sin [2rf,(t — nT) — NT.) + Ola(t — nT, — NT))}, 


feShi. (12) 
Comparing equation (12) with equation (10) shows that 
s() = a“ hia (13) 
Substituting equation (13) into equation (9) gives 
8&5 Pa , 
oe = | 280 - OKO a, Sh. (14) 


Since a function and its Hilbert transform are orthogonal, equation 
(14) reduces to 


oes Pe) a qs) dt, of. Sh. (15) 


Thus, 09&)/00 can be generated by correlating g(t) with §(¢). Note 
that the transversal equalizer output will be §(¢) instead of s(t) if 
the demodulating carrier cos (27f,t + 6) is replaced by cos [2rf,t + 
6 + 2/2). However, even though §(t) can be generated in this simple 
fashion, we prefer not to generate s(t) because the system must be 
used instead to generate s(t) to compute the other partial derivatives 
0&,/de, . Therefore, we convert equation (15) into the form 


2 





=2f aoa, fSh. (16) 


This step can be verified by Parseval’s theorem. Now we need only to 
correlate s(¢) with an easily generated 4(t) to obtain 08/00. 

The above is for the case f, S f,. In the other case, f, 2 fe, the 
frequency spectrum of a(t) occupies a frequency band lower than f, ; 
therefore, the two equations above equation (12) should be rewritten as 


3C[cos (2rf,¢t + d)a(t)] = sin (27f,t + d)a(d), 
sC[sin (2rf,t + 0)a(4)] = —cos (2rf,t + Aad), | Pare ee 
Repeating the steps from equation (12) to equation (16), we get 


a= of edt, Bh. an 
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Note from equations (17) and (16) that the sign of the correlator output 
must be reversed when one shifts the carrier frequency from one side 
of A(f) to the other side. 

About the equalizer tap gains, it is seen from equations (6) and (8) 
that 


8 = [ape — ao) 2 at, 





= / Oh OHS SN n= SN to 
(18) 


Thus, 08,/de, can be generated by correlating the error signal [s(¢) — 
q(t)] with the output y(é — nZ — NT) of the nth tap. This is the 
concept introduced in Ref. 3 where the problem of setting the tap 
gains {e,} was considered. 


IV. RECEIVER STRUCTURE 


It has been shown in the previous section that 08/06 can be easily 
generated simultaneously with 0&,/de,, n = —N to N. Therefore, 
the method of steepest descent can be used to adjust simultaneously 
6 and j{e,}. In the training period prior to data transmission, isolated 
test pulses are transmitted. For instance, 6(#) in Fig. 1 may be one of 
the test pulses. The transmission of 6(#) generates a signal s(¢) at the 
equalizer output. From s(é) the partial derivatives 0&)/06 and 0&)/0de, , 
n = —N to N, are computed. The parameters @ and {e,} are then 
changed by amounts proportional to the partial derivatives, that is, 





_ a& 
Ord i— ota Qa 06 ’ 
Ca ae = (€n)ota yet B oe ’ n= —N to N, 


where a and 8 are positive proportional constants which may vary 
from one adjustment to another (the choice of their values will be 
considered in Section VI). After the changes are all made, another 
test pulse is transmitted and the process is repeated. The process is 
terminated after a prefixed number of test pulses. 

The receiver structure is shown in Fig. 2. The partial derivatives 
0&,/00 and 0&)/de, are computed according to equations (17) and (18), 
respectively (f, 2 f. is assumed). If f, S f,, the correlator output in 
Fig. 2 will be —0&,/08. 
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Y (t-N TNT) 





PHASE SHIFTER 


Fig. 2—Block diagram of the joint method. (C in the block denotes correlator.) 


V. FURTHER ANALYSIS 


We now analyze the system to answer the questions: (2) Will the 
mean-square error converge to the absolute minimum by the proposed 
adjustments? (77) Is the convergence ensured regardless of how the 
parameters @ and {e,} are set prior to the starting of the process? (772) 
What is the minimum mean-square error that can be obtained by the 
proposed method? To answer these questions, it is necessary to locate 
the stationary points of &, to distinguish between various types of 
stationary points, to determine conditions under which &) will converge 
to a global minimum, to derive explicit solutions of & when the param- 
eters are set jointly or independently, and to obtain numerical data 
by simulating a real typical channel. In this section, we determine the 
location of the stationary points and distinguish between various types 
of stationary points. 

It is noted from equations (16) and (17) that 0&,/d0 changes sign 
when the carrier frequency f, is shifted from one side of A(f) (that is, 
f. S fi) to the other side (f, 2 f.). To avoid this complication, a quantity 
p defined by 


6, when f, Sf,, 


—6, when f, 2 f., 


> 
l 


(19) 
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will be used instead of 6 in the following. By this change, the results 
obtained in the sequel will hold for both f, S f; and f, 2 f.. Hence, 
the position of f, will not be identified further. 

Since it is more convenient to use matrix notations in the following, 
time samples will be used. Let 


&, = s(kT + NT), 


Qe = Uhl, + NT), (20) 

y= y(—kT,). 
Then equation (6) becomes 

J 
& = =~ & 
0 fo 

where 

&= di &— aw) (21) 


Since fp is a constant, minimizing § minimizes &. 
It can be easily shown that equation (21) can be written in the 
following matrix form 


& = E’yE — 2E’v + p> gi (22) 


where the prime denotes transpose, the vector E and v and the matrix 
y are defined by 


r 


én Vv_N 
| 
E = | €-n+1 vos V-N+1 (28) 
én | Un 
 apisad sain Y-n.N 
y=| : - |, (24) 
Yn,-n °° **)— os UN 
Uni = > YVa-nVi-k (25) 
k= 00 


Vn = 2 VV n-k ° (26) 


= —00 
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Clearly, E and v are (2N + 1) X 1 vectors, and y is a (2N + 1) X 
(2N + 1) matrix. One can easily show that y is nonsingular. 
We now evaluate the partial derivatives 0&/dp and 0&/de, ,n = —N 
to NV, to determine the optimum parameter settings. From equation (22), 
96 _ Opp _ 9 Ip 
= Pale a ae (27) 
To evaluate the terms in equation (27), we note from sampling theorem 
that equation (25) can be written as 


eee [ : v(by{t — (h ~ 4)T,] at. (28) 


By Parseval’s theorem, equation (28) becomes 


the =o f 1TOP exp (afk — )Tolaf. (29) 


For single-sideband systems, the demodulating carrier phase p appears 
only in the phase characteristic of I'(f), while the amplitude char- 
acteristic | I'(f) | of I'(f) is independent of p. Therefore, from equation 
(29), yn. is independent of p and 


9 pep 
3p EVE = 0. (30) 


This greatly simplifies the results. (It is important to note that this 
simplification is not possible for double-sideband and vestigial-sideband 
systems because p appears in | I'(f) | in such systems.) Substituting 
equation (30) into equation (27) gives 

ag _ 

eae (31) 


To evaluate equation (31), we convert the elements »v, of v into explicit 
functions of p. For single-sideband systems, ['(f) can be decomposed 
into the form 


Pf) 


H(f) exp (—jp), f 20; 
H(f) exp (jp), fS0; 
where H(f) is independent of p.* From equations (26), (20), (4), and 


(32) 


* H(f) is the Fourier transform of y(t) when p = 0. The amplitude and phase of 
H(f) a be denoted by |H(f)| and n(f), respectively, that is, H(f) = |H(f)| exp 
[jn(f]- 
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(82), it can be shown that 


Un = Mn EXP (—jp) +», exp (jp), n= —NtoN (33) 
where 
tn = > ae [HO exp Ui2mf(e — n)Pol af (34) 
and 
n= DO af HO exp tiene Ia. 8) 


Note that yu, and y, are independent of p. From equation (33), we can 
evaluate the term 0/dpE’v in equation (31) to obtain 


0& : : : 
a 2j[exp (—jp)E’u — exp (jp)E’y] (36) 
where 
Hn Vin 
w= cee yes a : (37) 
Ly YN 


The above is for 08/dp. The other partial derivatives can be obtained 
from equation (22) as 


0g 
aE = 2yE — 2v. (38) 


A necessary condition for a specific p and E to be jointly optimum 
(that is, to jointly minimize 8) is that they satisfy 


ag 


3p = 0 (39) 
and 

0g 

3E 0. (40) 


There are special cases where the optimum setting of p is arbitrary 
(for instance, if an infinite length tapped delay line is used, the taps 
can always be adjusted to reduce the mean-square error & to zero 
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regardless of how p is set). We shall not consider such special cases here. 
This implies, as can be shown from equations (86), (88), (89), and 
(40), that the special cases where u’y “uy is zero are disregarded. 

From equation (38), equation (40) can be written as 





E = y'v. (41) 
From equations (36) and (41), equation (39) can be written as 
exp (—jp)u’y"'v — exp (jp)v’y 'v = 0. (42) 
One can show from equation (83) that equation (42) is equivalent to 
vy 
te oe (j4p). (43) 


It can be shown from equations (84) and (35) that 
Re [vy ‘v] = Re [v’y"'v], 


and 
Im [y/y‘u] = —Im [v’yv], (44) 


where Re and Im denote real and imaginary parts of the complex 
number, respectively. From equation (44), one can show that equation 
(48) is satisfied if and only if 


as -: Im [y’y" ul] 
p= M = 9 E44 tan Re (y'y ial (45) 


where my can be any integer including zero. Substituting equation (45) 
into equation (33) and substituting the resulting equation into equation 


(41) give 
= fran Ta aoe A eye) -1 
E = exp | ifm, 9 + 5 tan Relay ul you 


-1 Im [u’y ul ~1 

+ exp | (m, 2 9 + #tan flea) ly v. (46) 

The value of p and E which satisfies the necessary conditions (89) 

and (40) is given by equations (45) and (46), respectively. It is clearly 

seen from equations (45) and (46) that, since m,) can be any integer, 

there is more than one set of solutions of p and E from conditions (89) 

and (40). In the following, we determine which of these solutions 
actually minimizes & . 

In conventional terminology,* each solution of conditions (89) and 
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(40) is a stationary point of & . We shall determine which of the sta- 
tionary points is a global minimum. 

We first identify the minima, maxima, and saddlepoints. As is well 
known,’ a sufficient condition for a stationary point to be a local mini- 
mum is that the quadratic form 


F = h’Qh (47) 


be positive-definite at that stationary point, where Q, known as the 
Hessian matrix, is 








a°& _8& ~.. A & 
Op” Op 0€_n Op Oey 
a” &> VE —  _0'& 








Q=|deydp dy dy Oey 


OE _8& ~~. a & 
Oey Op Oey O€_y Oex 


and 
h’ = [hy he paver hen+2)]. 


A stationary point is a local maximum if F is negative-definite at that 
point. At a saddlepoint, F is indefinite. To evaluate F, note from 
equations (16), (17), and (19) that 





ie = 2 / ; g(i)s(f) at (48) 
and 
dst) _ 
a $d). (49) 
From equations (48), (49), (8), and (18), we get 
ra 22 [ : g(i)s(1) dt; (50) 


88> f nuit ae , ras . 
Spa 2 fe mle — NT 4) dt, m= —N to N; (51) 


and 





2 (2) 
ED has i v(t — iT) — NTo)y(¢ — iT. — NT.) dt, 
1,j=—-N to N. (52) 
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Substituting equations (50), (51), and (52) into equation (47) and 
rearranging, we obtain after some steps 


F = 2 [ | iad -+- > Neanen y(t = nT'> =, wr | dt 
-0 n=—-N 
+ ant | g(s( at 


— on? / ” (i) dt. (53) 


Now we may substitute equations (45) and (46) into equation (53) 
and evaluate the resulting expression over all possible h, to hey+2) 
to determine whether F is positive-definite, negative-definite, or in- 
definite at a given stationary point. This determines if that point is 
a local minimum, a local maximum, or a saddlepoint. While these steps 
are important in the analysis, they are rather complex and are therefore 
given in Appendix A. It is also shown in that appendix that all the local 
minima are equal and hence all are global minima. The results in 
Appendix A are summarized in the following proposition. 


Proposition 1. & has a global minimum when p and E are given, re- 
spectively, by equations (45) and (46), with m, in these equation being 
an even integer. & has a saddlepoint when p and E are given, respec- 
tively, by equations (45) and (46), with m, in these equations being 
an odd integer. 

It is seen from this proposition that there is an infinite number of 
global minima, each one corresponding to an even integer m,. The 
distance in p between two adjacent global minima is therefore z. There 
is also an infinite number of saddlepoints, each one corresponding to 
an odd integer m, . The distance in p between two adjacent saddlepoints 
is also 7. The distance in p between a saddlepoint and its adjacent 
global minimum is 7/2. It is instructive to illustrate these with an 
example and a figure. Consider a transversal equalizer with only one 
tap é (such a single tap serves as an automatic gain control and 
the problem is to jointly set the automatic gain control and carrier 
phase to minimize the mean-square error). For simplicity, suppose 
that the term tan™* [Im (y’y‘u)]/[Re (w’y‘w)] in equation (45) turned 
out to be zero. Then from the proposition § has global minima at 
p = 0, +a +27, --- , and & has saddlepoints at p = +7/2, +37/2,---. 
The global minima at p = 0 and 7 are illustrated by points 1 and 3 
in Fig. 3 and the saddlepoint at p = 7/2 is illustrated by point 2. 
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Fig. 3—An example illustrating the global minima and saddlepoints of &o. 


The curved surface in the figure illustrates the variation of &) with p 
and é) . For instance, when p is fixed at 7, &) varies with é) as shown by 
the convex curve passing through points 4, 3, and 5. (For fixed p, & 
is a convex function of the tap gains e_y to ey .) As can be seen, point 
2 is a saddlepoint because varying e, away from point 2 with p constant 
increases § , while varying p away from point 2 with eé) constant 
decreases &. 

To summarize, in this section we have located the global minima 
and saddlepoints and proved that there is no local minimum or maxi- 
mum. There is also no valley’ since the global minima are all distinct. 
These results will be used in the next section for the study of con- 
vergence and in Sections VII and VIII for the computation and com- 
parison of performances. 


VI. A CONDITION OF CONVERGENCE 


As described in Section IV, after a test pulse is transmitted and the 
partial derivatives are computed, the carrier phase and the tap gains 
are adjusted according to the equation 
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asad, ant OBS 
Ones 7 Bora a 06 ’ (54) 





a8) 


C.)aeae a (nora a7 B de, ’ 


(55) 
where a and 8 are positive proportional constants which may vary 
from one adjustment to another. As is well known,’ the process always 
converges when a and £ are sufficiently small, but may not converge if 
a and 8 are too large. In this section, we derive a condition on a and B 
which ensures the convergence of the process. 

To facilitate analysis, we shall, as in Section V, replace @ with p, 
& with &, the tap gains e_y to ey with the vector E, and the partial 
derivatives 0&/de_y to 0&/dey with the vector 0&/dE. The adjustment 
made after the kth test pulse will be called the kth adjustment (k = 
1, 2, ---). The values of p, E, &, 0&/dp, and 08/0E prior to the kth 
adjustment will be denoted by p, , Ex, &(o,, Ex), (0/0p)&(p;, , E,) and 
(0/0E)&(p, , E,,), respectively. The a and @ used in the kth adjustment 
will be called a, and 8, , respectively (note that a, > O and 6, > 0 
for all k so that the adjustments will be made in the negative gradient 
direction). The values of p, E, &, 0&/dp, and 0&/0E after the kth adjust- 
ment will be denoted by Pk+1> Ext ’ E(px+1 ? Ey.+1); (0/dp) E(pe+1 ’ E41), 
and (0/0E) &(px+: , Ex+1), respectively. 

With the above notations, equations (53) and (54) can be written as 


0 
Pk+i = Pr — Oy ap (px ’ E,), (56) 


0 
Est = E, =P, JE &(px , E,). (57) 


The decrease in mean-square error due to the kth adjustment is denoted 
by A&, , that is, 


A&, = (px ) E,) = S(pe+1 ’ Ex+t)- (58) 


Clearly A&, approaches zero when the partial derivatives approach 
zero. A stronger statement that may sometimes hold is that ‘A6&, 
approaches zero only when the partial derivatives approach zero.” 
By this statement is meant that for every « > 0, we can find a 6 > 0 
such that 


| A& | = 6 (59) 
if 
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[2 em .B)] +(Sem.2)| |S em.eo)e« 60 


We shall say that & converges to a stationary point if for every « > 0 
there is an N such that 


E &(on , B) | + E (6, z)||3 im B)|<e 


forallk > N. 
The following lemma is needed in our discussion. 


Lemma. If A& > 0 for all k and A&, approaches zero only when the 
partial derivatives approach zero, & must converge to a stationary point. 


Proof: The proof of this lemma is simple. Assume that & does not 
converge to a stationary point, that is, for an e > 0 it is not possible 
to find an N such that equation (61) holds for all k > N. Then equation 
(59) must hold for an infinite number of k’s. From this and the assump- 
tions in the lemma, we see that A&, 2 6 > 0 for infinite number of k’s. 
This implies that & reduces without bound as k increases, contradicting 
the fact that the mean-square error cannot be less than zero. Hence 
the lemma holds. 

The lemma provides a means of determining «, and 6, . According 
to the lemma, & will converge to a stationary point if we can determine 
a, and $, such that A&, > 0 for all k and A&, approaches zero only 
when the partial derivatives approach zero. Theoretically, such oa, 
and 6, can be determined using known mathematical techniques.’ 
But unfortunately these techniques were developed for use with com- 
puters and are too complicated to be used in a receiver (for a discussion, 
see Appendix B). In the following proposition, it is shown that a, 
and §, can be determined rather easily if E and p are set in some alternate 
fashion. 


Proposition 2. Let the set of positive integers be divided arbitrarily 
into two disjoint subsets K, and K,, each containing an infinite num- 
ber of positive integers. Let a, = 0 when ke X,, and 8, = O when 
k « K,. Let d, denote the maximum eigenvalue of y (A, > O since y 
is positive definite), and let [f%,, s°(£) df]y, denote the value of {@,, s’(#) dé 
when E = E,. The mean-square error & will converge to a stationary 
point if 


0<A<= (62) 
1 
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for ke &,, and 


1 


22 
mal [ #@ at|, + 2% [gat 
forke K.. 


The proof of this proposition is complex and is given in Appendix C. 
The proposition states that p and E may be adjusted in any alternating 
fashion' and the mean-square error & will converge to a stationary 
point if 6, satisfies equation (62) during the adjustment of E, and 
a, satisfies equation (63) during the adjustment of p. The term ), in 
equation (62) and the terms [f%,, s(t) di]p, and {%,, q’(é) dt in equation 
(63) can be estimated or measured. Consider first \,, the maximum 
eigenvalue of y. There are various methods of estimating the maximum 
eigenvalue of a matrix.’ For example, it is possible to estimate ), 
from amplitude characteristics of the transmission medium. [Note 
from Fig. 1 that the amplitude and phase characteristics of the trans- 
mission medium are denoted by | W(f) | and (f), respectively.] It is 
seen from equation (29) that the elements y,,; of y can be computed 
from | I(f) |. For single-sideband systems, | T'(f) | depends on | W(f) |, 
but not on w(f) and the demodulating carrier phase p. Thus, if statistics 
of |W(f)| are available (for example, in a voiceband system), yp,; 
can be computed from equation (29) and d, can be estimated. The 
maximum possible value of \, then can be used instead of ), in equation 
(62). 

The factor {®., q’(é)dt in equation (63) is known because the desired 
signal q(t) is given. The other factor [f@,, s°(t)dt]z, is simply the energy 
of the signal s(¢) prior to the kth adjustment, and can be easily measured 
at the equalizer output. 

Summarizing the above, a condition of convergence has been described 
in the lemma. Based on the lemma, a specific condition of convergence 
has been obtained in Proposition 2. The upper bound in equation (62) 
can be estimated from a priori channel statistics and the upper bound 
in equation (63) can be easily determined prior to each adjustment. 
Thus, a, and 6, can be set accordingly prior to the adjustments to 
ensure that & will converge to a stationary point. 

It has been shown in the previous section that a stationary point 
must be a global minimum or a saddlepoint. Thus, § may converge 





0O<a< (63) 


t For example, one may fix p and adjust E until 0&/dE approaches zero, then fix E 
and adjust p until 0&/dp approaches zero, and repeat the cycle until both 0&/dE and 
0&/dp approach zero. 
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to a saddlepoint instead of a global minimum. Fortunately, such a 
possibility is remote. A great advantage of gradient methods is that 
they will inherently stay away from saddlepoints.” It has been found 
by researchers that gradient search computer program avoids saddle- 
points so dependably that the only way they could test their program 
for exploring the neighborhood of a pass was to start the search there. 
Wilde and Beightler suggested the reason by sketching a bimodal 
surface to show that only one gradient line out of the infinite number 
possible actually passes through the saddlepoint.” The other gradient 
lines all lead directly to a minimum or a maximum. Hence the possibility 
of converging to a saddlepoint is remote. 

It has been shown in the previous section that the distance in p 
between a saddlepoint and its adjacent global minima is 7/2. From 
this, several tests can be devised for distinguishing between global 
minima and saddlepoints. The following one is particularly simple. 
At the design stage of the system, one may compute the value of & 
at global minima and saddlepoints (the detailed steps in Section VII 
may be used. The value of & at global minima is equal to &,,;, in equation 
(64), while the value of § at saddlepoints is equal to &;,4 evaluated at 
Ap = 7/2.) As illustrated in Section 8.1 and Fig. 4, the value of & at 
saddlepoints can be many times larger than that at global minima. 
Consequently, a threshold can be set up such that when §& converges 
to a value above the threshold it may be concluded that a saddlepoint 
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Fig. 4—Variation of Sina with Ap. (See example in Section VIIT.) 
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is reached. Then p will be shifted by 7/2 and E readjusted. If & con- 
verges to a value below the threshold, the process 1s terminated. This 
and other possible tests all require additional circuitry. Since the 
possibility of converging to a saddlepoint is remote, one should decide 
from actual trials whether such a test should be employed. 


VII. PERFORMANCE COMPARISON 


In the previous sections, we have considered jointly setting p and E 
for minimizing the mean-square error &. It has been shown in Prop- 
osition 1 that the joint optimum settings of p and E are given, re- 
spectively, by equation (45) and (46) (with mp) in these equations being 
an even integer). Substituting these optimum settings into equation 
(22), we obtain the minimum mean-square error 


ie) 


Sain = 2 q: ~ 2\u’'y "ul — Quy”. (64) 
It has been shown that this &,;, can be obtained by setting p and E 
jointly. Now we wish to compare 6n;, With what can be obtained by 
another scheme. In single-sideband systems, it is possible to transmit 
a carrier pilot to the receiver to demodulate the received signal. The 
demodulating carrier phase therefore is 


Pp = pe + pr, 


where p, is the phase of the received carrier pilot and p; is a fixed phase 
shift that is sometimes introduced for signal shaping. With p fixed at 
pe + ps, E can be adjusted to minimize &. The value of & thus obtained 
will be denoted by &i,4, where the subscript ‘“‘ind’’ indicates that p 
and E are set independently. We now compare E,,4 with Eni, . 

To determine E;,, , let the difference between p, + p; and the optimum 
setting of p be denoted by Ap. From equation (45), 


-1 Im [y’y™ 
dp = pet or — [mo + 3 tan lee | (65) 


where ™m, can be any even integer (including zero). Since E is set to 
minimize & after p is set to p, + p,;, E is equal to y ‘v evaluated at 
p = p. + py, that is, 


ES yy (66) 


where v, is v evaluated at p = p, + p,. Substituting equation (66) 
into equation (22) gives 
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Sing = 2s Gi = viy”'v. . (67) 


=-0 


It can easily be shown from equation (33) that 
vy 've = exp [—j2(o. + p,)lu’y'y + 2y’y'v 
+ exp [j2(p. + ps)]v’y”'v. (68) 


Substituting equation (65) into equation (68), and substituting the 
resulting equation into equation (67), one can obtain after some manip- 
ulations that 


Sing = Dy Ge — 2(cos 2 Ap) | wy un | — 2y’y”’v. (69) 


k=—-00 
T'rom equations (64) and (69), the difference between 6ing and Sin IS 
Sind a Onan = 2[1 — COs 2Ap| |u’y yu ( (70) 


Note that uw’ and y~’ are independent of Ap. Thus, the only term in 
equation (70) that depends on Ap is cos 2Ap. Now we can make the 
following observations. 

First, ina — min 1S nonnegative, meaning that independently 
setting p and E increases the mean-square error. Second, Sina — min 
varies periodically with Ap with a period of z. Third, because of the 
nature of cosine, ina — Smin 18 small when Ap is small, but increases 
rapidly when Ap increases (note the factor two in cos 2Ap). 

Tor a given system, one may compute Sing, Smin, and their dif- 
ference from equations (64) and (69). The computation can best be 
carried out by a computer program in the following steps (see also the 
example in the next section). First, specify fy, N, and the desired 
signal g(é). Determine the time samples {q,} of q(¢). Second, determine 
H(f) from transfer functions of the transmitting filter, the transmission 
medium, and the receiving filter. Third, compute the elements y,,; 
of y from the equation 


Ux.c = Af ie | H(f) |? cos [2rf(h — 2)To] df. (71) 


Compute the elements a,,; of y-* from y,,; . Fourth, compute the terms 
|u’y ‘u| and u’y~‘yv in equations (64) and (69) using the explicit equa- 
tions 


[vy uw | = [(B. — B.) + 4B:], (72) 
wy v= B, +B, (73) 
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where 
~~ Bi ete a) 
-> Ds a, sSase s (75) 
-> = On, Ens (76) 
i= Do Pa "| HG) | cos [n(f) + 2xf(kPo — Pa, (77) 
c= DS af | HO isin in) + 2efo — Olaf. — (78) 


Finally, compute &,;, from equation (64) and &j,4 from equation (69). 


VIII. COMPUTER SIMULATIONS 


We now apply the previous results to a single-sideband partial- 
response voiceband data communication system using a five- or nine- 
tap transversal equalizer. Since the results are largely similar, we shall 
describe only the five-tap case. 

Two computer programs have been written for this system. In the 
first program, 6,3, and Sina are computed using the formulas in the 
previous section. In the second one, Snin and &in4 are obtained using 
the method of steepest descent. The results of these two programs 
are described separately in the following subsections. 


8.1 Comparison of &nin aNd Sing Using the Explicit Formulas in Section 
VII 


Consider the voiceband data communication system described in 
Ref. 8. The desired signal g(¢) of such a system is the class IV partial- 
response signal,’ that is, 


Q*r sin Qaf(t — to) 


g(t) = [aft — i) = a2’ (79) 


where a time delay ¢é) is included to take into account the time delay 
in the channel. From Ref. 8, 


fo = 1200 Hz. (80) 
Hence, 7) = 1/2f) = 1/2400 seconds. We shall consider a transversal 


1090 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1970 


equalizer with five taps, that is, N = 2. It has been defined that H(f) 
is the Fourier transform of y(t) when p = 0. The amplitude and phase 
of H(f) have been denoted by | H(f) | and n(f), respectively, that is, 


A(f) = | H(f) | exp [in]. (81) 


It is assumed that the amplitude and phase distortions in the filters 
are negligible compared with those in the transmission medium. Con- 
sequently, | H(f) | and n(f) can be determined from amplitude and 
phase characteristics of the transmission medium (which is a voice 
channel in this case). From Ref. 10 and Tig. 4 of Ref. 8, a typical 
| H(f) | is found to be 


| H(f) | = [sin 247 f]10°°7° “7”, ee a (82) 


The component sin 277pf in equation (82) represents the desired am- 
plitude characteristic of the class IV partial-response signal, while the 
term 10°°7°%"?? represents typical amplitude distortion in a voice 
channel. The delay characteristic of five links of K carrier shown in 
Tig. 24 of Ref. 11 is representative of the delay characteristic of a voice 
channel. Therefore, it will be used here and 


n(f) = 9.89 sin [2r(f + f.)-0.00019 — 2.203], OSf < fy. (83) 


Constant phases and time delays in the transmission medium and the 
filters are omitted in writing (f) because their values are not available 
and because they only change the phase and time origins in the com- 
putations. However, such an omission makes it impossible to determine 
the phase p, of the received carrier pilot because the carrier pilot does 
not travel the same path as the signal. The signal is transmitted through 
the transmitting and receiving filters, while the carrier pilot is trans- 
mitted outside of these filters (these filters theoretically should have 
infinite attenuations at the carrier frequency). Furthermore, the carrier 
pilot is recovered at the receiver through a separate narrowband filter 
or a phase-lock loop, while the signal is demodulated and passes through 
a low-pass filter. With these differences and without detailed phase 
characteristics of the filters, it is not possible to determine p, here. 
Therefore, we shall simply leave p, and Ap as variables and compute 
the variation of &;,¢ With Ap. (It should be noted that Ap may assume 
small values in some real systems.) 

The variation of &;,3 with Ap has been determined for various values 
of tj. The curve obtained at f; = —27', is typical and is presented 
in Fig. 4. The value of &,i, 1s also indicated in the figure. It can be 
seen that &,,q can be as large as 0.034, while &,;, is only 0.0014. Thus, 
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the mean-square error can increase 0.034/0.0014 = 24.3 times if the 
demodulating carrier phase p is not set properly. Note that this large 
increase is obtained for a five-tap transversal equalizer. A similar 
result has been obtained for a nine-tap transversal equalizer. These 
results demonstrate that the mean-square error depends critically on 
the carrier phase setting when the number of equalizer taps is not large. 
(It should be noted, however, that when a large number of taps are 
used, the mean-square error will not be sensitive to the carrier phase 
setting.) 


8.2 Mean-Square Errors Obtained by Method of Steepest Descent 


A computer program has been written to simulate the system de- 
scribed in Section 8.1. The receiver parameters are adjusted by the 
method of steepest descent described in Section IV. Five hundred test 
pulses are used in each training period. The initial setting of the center 
tap é€) is unity, while the initial settings of the other taps are zero. The 
parameter t) is fixed at —27') as mentioned in Section 8.1. The mean- 
square errors obtained with various initial settings of p are shown in 
Fig. 5. 

The points marked by “X” in Fig. 5 are obtained by adjusting p 
and E jointly using a, = 0.523 and 6, = 0.1 for all k. For instance, 
point A is obtained by initially setting p to 40 degrees above the optimum 
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Fig. 5—Mean-square errors obtained by method of steepest descent. 
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p and then adjusting p and E jointly by the method of steepest descent. 
It can be seen from these points that the mean-square error & always 
converges to the value of &,;, determined in Section 8.1. This demon- 
strates the fact that 6:1, can be obtained by jointly setting p and E. 

The other points marked by circles are obtained by fixing p and ad- 
justing E only (a, = 0 and 8, = 0.1 for all k). For instance, point B 
is obtained by initially setting p to 40 degrees above the optimum p and 
then adjusting E only by the method of steepest descent. It can be seen, 
by comparing these circled points with the value of &;,4 in Fig. 4, that 
the mean-square error & converges, as expected, to Sina. 

Only 500 test pulses are used in each training period because & 
converges within this period in all cases. For instance, the convergence 
of & to the values shown by points A and B in Fig. 5 are illustrated, 
respectively, by curves A and B in Fig. 6. It can be seen that € converges 
rapidly to the final values. 


IX. SUMMARY AND CONCLUSIONS 


It is shown in this paper that in single-sideband systems the trans- 
versal equalizer and the carrier phase can be set jointly by the method 
of steepest descent to minimize the mean-square equalization error. 

The system is analyzed and a receiver structure is developed. The 
receiver structure is theoretically as simple as a conventional one. 
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Fig. 6—An illustration of convergence. 
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A well-known problem of the method of steepest descent is that the 
function to be minimized may converge to a local minimum instead 
of to a global minimum. To prove that this troublesome problem does 
not arise here, the variation of the mean-square error in the parameter 
space is analyzed. Exact locations of the stationary points in the param- 
eter space are determined and the classifications of the stationary points 
are obtained. It is shown that the mean-square error has only global 
minima and saddlepoints, but not local minimum or maximum. Thus, 
it is not possible for the adjustments to converge to a local minimum, 
regardless of the initial parameter settings. This completely eliminates 
the problem. 

It is also shown that two adjacent global minima (or two adjacent 
saddlepoints) are separated by 180 degrees in demodulating carrier 
phase, while a global minimum is separated from its adjacent saddle- 
points by 90 degrees in demodulating carrier phase. From this, a test 
is described to determine whether a global minimum or a saddlepoint is 
reached by the adjustments and to correct the settings if a saddlepoint 
is reached. This test may not be necessary because both previous ex- 
perience and theoretical considerations have shown that the method of 
steepest descent inherently stays away from saddlepoints. 

The choice of the step sizes of the adjustments is also considered. 
There are methods for determining the step sizes; however, they require 
complicated computations. As an alternative, it is shown in this paper 
that the step sizes can be easily determined if the equalizer and the 
demodulating carrier phase are adjusted in different steps (the steps 
can be alternated in any manner). 

Closed-form expressions of the joint optimum parameter settings 
and of the corresponding minimum mean-square error are obtained for 
computation of system performance. For illustration purposes, a single- 
sideband data communication system using a five- or nine-tap trans- 
versal equalizer is simulated on a computer. Both theoretical and simu- 
lation results show that the equalization error can increase by ten times 
or more when the carrier phase is not properly set. These demonstrate 
that when the number of equalizer taps is not large, the equalization 
error depends critically on the carrier phase setting. The computer 
simulation also verifies the theory that the equalization error can be 
minimized by using the joint method described in this paper. 
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APPENDIX A 


Classification of Stationary Points 


In this appendix, we prove the statements that: 


(i) F is positive definite when p and E are given, respectively, by 
equations (45) and (46), and mp, in these equations is an even integer; 
(it) F is indefinite when p and E are given, respectively, by equations 
(45) and (46), and mp, in these equations is an odd integer; and 
(227) All the local minima of &) are equal. 


Consider the first statement. Let Q be the set of all h except h = 0. 
Let & be decomposed into two disjoint sets Q; and 2, , where Q, contains 
the h’s with h, # 0, and Q, contains the h’s with h, = 0. Consider first 
Q, . Since hy ~ 0 in Q, , equation (53) can be written as 


PF = 2h a * fo(t) — q()} dt + 2h? fe * a(t)s(t) dt — 2h / ” (8) dt 


(84) 
where 
N 
a(t) = Do eet — nT, — NT») (85) 
n=—-N 
and 
? Roantn 
= n= -—-N to N. (86) 


n h, ? 


For the sake of brevity, we shall denote the entire right side of equa- 
tion (45) by po and the entire right side of equation (46) by E,. For 
any function G, the symbol [G],, denotes the value of G evaluated at 
p = po, the symbol [G],,,~, denotes the value of G evaluated at p = po 
and E = E,, and the symbol Min G denotes the minimum value of 
Gin Q, . These symbols and notations may be used jointly. For instance, 
Min [G],,,z, denotes the minimum value of [G],,,5, in Q . 

Since h; ~ 0inQ,, [F],,.2, > 0 in Q, if and only if [F/2h7],,.5, > 0 
in Q,, or if and only if Min [F/2hi],,.n, > 0. From equation (84), 
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Min lel ee er | ; [o(t) ~ (FP al 
+ | [aoe ul 
- | [ «% at). (87) 


In writing the above equation, we have used the fact that o(é) is inde- 
pendent of E, s(¢) is independent of h, and q(t) is independent of p, E, 
and h. 

Consider the first term on the right side of equation (87). From equa- 
tion (85), c(t) is a function of #(¢), where ~ indicates Hilbert transform. 
It can be shown from equations (7) and (19) that 


[7]. ao ly¥@)oo+nre . (88) 


Now we have 
Min | | [o(t) — g(t) ar| 
"eS io) N . 2 
Min i | el¢(t — nT, — NTo)\,. — x | dt, 
—30 N 


n=—s 


io) N 2 
Min [ bs enly(t = nT a NT 0) |o.+272 a «(| dt, 


n=—N 
0 N 2 
= {tin [ b> ey(t — nT, — NT,) — a | at} (89) 
00 n=—-N Potm/2 ° 
Note that equation (88) is used in the second step above. Since the 
term ees v ely(t — nT, — NT) in the last expression above is similar 
to s(t) in equation (8), the whole term in the { } in the last expression 
above is equal to &) minimized with respect to E. This proves that 
Min [f%., [«() — q(t] dé],, is equal to &) minimized with respect to E 
and evaluated at p = py + 2/2. It can be easily shown from equations 
(21) and (22) that & minimized with respect to E is equal to 
1/2fol Ue--» a — V'y~‘v]; hence, 


Min | a ; [o(t) — g(OP al 


a uathe eae ae ees 
= 2, 2 de fp [v’y Viootx/2 ’ 


Lae 1 : eae were! 
= be > & —s> Lu’y vy — exp (—j2p0)u'y a — exp (j2p0)v/y”v], 
fo k 2fo 


; (90) 
where equation (33) has been used in the last step. 
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We have evaluated the first term in equation (87). Consider now the 
second term in equation (87). From equations (20), (23), and (26), 


[ ic o(t)s(t) dt = x VE, (91) 


Substituting equations (33), (45), and (46) into equation (91) and sim- 
plifying, we obtain, after some steps, 


i as aul E =; U ‘v v 1 ee oa exp (— j2po)’ y ‘yu 


1 ’ - 
+ a exp G2p0)v’y vy. (92) 
The last term in equation (87) is 
[ toa-5 > a. (93) 


Substituting equations (90), (92), and ie into (87) and canceling out 
the terms having opposite signs, we obtain 


_[F ee : 1 
Min |# | = exp (—j2p.)u'y'u + = exp (j2po)v’y'v. (94) 
7 A ri O fo 


Note that so far mp, can be any integer. Let | u’y~’y | denote the magni- 
tude of u’y~‘u. It can be shown from equation (44) that when mz, is 
an even integer, the right side of equation (94) is equal to 2/f.| u’y‘u |. 
Hence, when m, is an even integer (including zero), 


. Lue d= 1 
Min El ig ape |u Yy wu | . (95) 


Since | u’y~’u | # 0, equation (95) shows that Min [F/2h7],,,5, > 0. 
Thus, [F],,,.2, > 0 in Q, when mp is an even integer. 
Now consider Q,. Since h, = 0 in Q,, equation (53) is reduced to 


F=2 dl : ES ee ee ae wr.) | dt. (96) 


n=—N 


Since y(t — nT) — NT,),n = —N to N, are linearly independent and 
hz to hey+2 cannot be all zero in Q,, the integrand in equation (96) 
cannot be zero for all t. Hence, F > 0 in Q,. This implies, of course, 
[Foo x6 > O0mQ,. 

We have shown that when m, is an even integer, [F],,.5, > 0in 
and 2, . Hence, when mz, is an even integer, [F],,,z, > 0 for all h except 
h = 0. This proves the first statement at the beginning of this Appendix. 


EQUALIZATION AND CARRIER ACQUISITION 1097 


Now we prove the second statement at the beginning of this Appendix. 
Note that the derivations from equations (84) through (94) hold for 
all integer m,. It can be shown from equation (44) that when mp, is 
an odd integer, the right side of equation (94) is equal to —2/fo| u’y~*y |. 
Hence, when m, is an odd integer, equation (94) becomes 


: te = 2 Nyy} 
oa Eas ay ee ls @?) 
Since | u’y~*y | + 0, Min [F'/2h7],,.5, < 0. Thus, [F],,,z, can be negative 
in Q, (and hence in Q). It can also be positive in Q (for instance, [F],,, x. 
> 0inQ,). Therefore, F is indefinite when p and E are given, respectively, 
by equations (45) and (46), and m, in these equations is an odd integer. 
This proves the second statement. 

Finally, consider the third statement. We have shown that F is posi- 
tive definite and & has a local minimum when p and E are given, re- 
spectively, by equations (45) and (46), and m, in these equations is 
an even integer. Substituting equations (45) and (46) into equation 
(22) and letting m, be an even integer, we see that the local minimum 
of & is 


i< 1 ies 1 _ 
& = Bf a, di — 7 ly’y en] — fy v. (98) 


Since the right side of equation (98) is independent of mp, all the local 
minima of & are equal and hence are all global minima. This proves 
the third statement at the beginning of this Appendix. 


APPENDIX B 


Discussion of Existing Method 


A number of methods are described in Ref. 5 for determining the 
proportional constant in steepest descent adjustments. It has been 
pointed out that these methods can be used to solve certain nonlinear 
equations on computer.” 

These methods require elaborate computations. For example, con- 
sider the possibility of determining a, and 6, using the theorem on page 
31 of Ref. 5. Since it is assumed in that theorem that a single propor- 
tional constant is used for all parameters, we change the scale of p 
or E such that it is also appropriate to use a single proportional constant 
in our case. After this is accomplished, we can use a single constant a, in 
equations (56) and (57). To determine a, , define 
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AE, 


of[ Go 20] + [oe 29] | 


Choose a constant 6 in the range 0 < 6 S$ 4. Compute g(p,, E,, 1). 
If g(p,, Ex , 1) < 6, choose a, < 1 so that 6 S g(p,, E,, a.) S1— 6 
(it has been shown that this choice is always possible). If g(p, , E, , 1) 
= 6, choose a, = 1. 

It can be seen from this description that the method requires elaborate 
computations and a, must be determined on a trial and error basis when 
g(p, » Ex, 1) < 6. It is therefore difficult to use this method in a receiver. 





g(px » Ex , a) = - (99) 


APPENDIX C 


Proof of Proposition 2 


In this Appendix, we prove Proposition 2. Consider first k « K,. 
Since a, = O whenke &, , we have p,.; = p, and 


A& = E(px , Ex) — &(px , Ex), 


= ElyE, — 2Ev+ >> gi 
h 


=—0 


aa Eee = 2Ef41V + ps “| : (100) 
I’rom equations (57) and (88) 
E.41 = E, — 8,(2yE, — 2v). (101) 


Substituting equation (101) into equation (100), we rearrange the 
resulting equation into the form 


A& = 48,(yE, — v)'[I — B.yl(yE. — v) (102) 


where I is the identity matrix. Let the eigenvalues of y be denoted, in 
the order of decreasing magnitude, by \;, 7 = 1 to 2N + 1, so that 


Ay Ag oto Se Mewen- (103) 
It can be shown that y is positive definite; hence, 
A; > 0, 4=1to2N +1. (104) 


Let u;, 7 = 1 to 2N + 1, be a set of orthonormal eigenvectors of 
y. Let 


Q = [ute ++ Uewveil. (105) 
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That is, the 7th column of Q is u; . From well-known matrix properties, 
y = QDQ’ (106) 


where D is a diagonal matrix whose 7th diagonal element is \; . Further- 
more, 


Qed =L. (107) 
Substituting equations (106) and (107) into equation (102), one can 


write 


2N+1 


A& = 46; > (1 — BA) (VE, — v)’u,]’. (108) 


If 6, satisfies equation (62), we can write 


1 a 7, BrAr = ea > 0. (109) 


Then, from equations (103), (108), and (109), we have 
2N+1 


A&, = 46,P, » [(yvE, — v)’u,]’. (110) 


i= 


It can be shown from equations (88), (107), and (105) that 
Fs) , a 2N+1 5 
E E(px , By | E (px 2) | =4 a. [((yE, — v)’u,]’. (111) 


Comparing equations (110) and (111) gives 


fe) 7 
A& z= ap 4, E(px , B) | 2. E(p, ’ B) | (112) 


Now we have the conclusion: 


Conclusion 1. If equation (62) holds, A&, is bounded below by equation 
(112) which shows that A&, is positive and approaches zero only when 
the partial derivatives (0/dE)&(p, , E,) approach zero. 

The above is fork « K, . Next we consider k « K,. Since 6, = 0 when 
ke X,, we have E,,, = E,. From equations (58) and (22) 


A& = &(px ’ E,) “= E(pi+t ’ E,), 


—2EL(v, = Viz) ’ 


(113) 


I 


where 
Vi = Xp (—Jpx)u + exp (Jpx)v, (114) 
Viwt = €XP (—Jpzir)u + exp (Jprss)v. (115) 
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From equation (56) and the two equations above, 


Ve = View = — COS | a E(px , B) |flexp (—jpx)Uut exp (Jpx)v] 


— jsin E = E(px B) |lew (—jo.)u — exp (jpx)y]. 


(116) 


The vectors u and v are defined in equation (37) and their elements 
are defined in equations (84) and (85). It can be seen from equations 
(84) and (85) that the elements of u and v are comple numbers. Let 
u be written as 


y= E+ 5 (117) 


where the elements of ~ and ¢ are real numbers that can be determined 
from equation (34). Comparing equation (35) with equation (34), we 
see that the elements of v are complex conjugates of those of wu. Hence 
v can be written as 


v= &— 50. (118) 


Substituting equations (117) and (118) into equation (116) and then 
substituting the resulting equation into equation (113), we obtain after 
some steps, 


A&, = 1{ co E 7 E(x, B) | a bree cos p, + Ez¢ sin py] 


— 4sin E E(pe ; B) | [Eff cos p, — Eze sin p,]. (119) 


From equations (86), (117), and (118), 


0 ; ; 
Op E(px ’ E,) 2jE;(exp (—Jpx)u — exp (ipx)v), 


I 


2jEi(—2j sin p= + 2] cos p,€), 
4([EzE sin p, — Eff cos p,]. (120) 


It is clear from equation (56) that, if (0/dp)&(p, , E,) = 0, then pra: = 
p, and from equation (115), A&, = 0. So we need to evaluate A&, only 
for (0/0p)&(p, , E,) ¥ 0. It can be seen from equation (120) that, when 
(0/0p)&(p, , E,) ¥ 0, ELE and E/¢@ cannot be simultaneously zero. Hence, 
we can define a quantity 
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1 Eig 


o = tan Br (121) 


and a quantity 
C = [(Bie)” + EO"? > 0. (122) 


From equations (121) and (122), equation (120) can be rewritten as 
0 ; 
ap &(px ’ E,) = 4C sin (p:, oo ¢). (123) 


Substituting equation (123) into equation (119), using equations (121) 
and (122), and rearranging, we obtain, after a number of steps, 


A&, = 4C{cos [p, — ¢ — 4Cc, sin (p, — ¢)] — cos (p, — ¢)} (124) 
or 
A& = 8C sin [Px —-o-— 2C'a, sin (px = ¢)| sin [2Cax, sin (px = o)]. (125) 


It may be assumed that 0 S p, — ¢ S 27 (a factor of 27 or its multiple 
can be dropped). Since (0/0p)&(p, , E,) ¥ 0, p, — ¢ cannot be 0, z, or 
2m [see equation (123)]. Hence, we have either 


O<p—¢<7 (126) 
or 
TS p—-Oo< Qn. (127) 


We shall consider equation (126) first. 

As can be seen from equation (56), the constant a, determines the 
size of each adjustment (note that a, > 0 is required so that the ad- 
justments will be made in the negative gradient direction). We wish 
to determine the permissible range of a, , that is, to determine a number 
6 such that A&, > 0 for every a, in the range 0 < a, < 6. It can be 
shown from equations (124) and (126) that 6 can only be as large as 
1/2C; hence, the permissible range of a, is 


J 
0< an < IC (128) 
From equations (128) and (126), we obtain 
0 < 2Cc, sin (p, — ¢) <1. (129) 


It can be easily shown from equation (129) that 


Pen ee 2 MCasintos 80. 430) 
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It can be shown from equations (126) and (128) that [p, — @ — 2Ca, sin 
(p, — #)] must be in the range 


0 < [or — ¢ — 2Ca, sin (o, — ¢)] <5 (131) 
or in the range 
5 < [px — & — 2Cax sin (p, — 6)] < x. (132) 
When equation (131) holds, 
sin [ox — ¢ — 2Ca sin (o, — $)] = = [ov — 6 — 2Caxsin (pi — 4)]. 


(133) 


Combining (133) and (130) and using equations (125) and (123), we 
obtain, after some simplification, 

2a, () ‘ 

A&, > =F [2 — 4Cax] ap (px , Ex) | ° (134) 


It can be shown in a similar manner that when equation (132) holds, 


SCai, | 9d ‘ 
A& > — |Z Bes B) | (135) 





It has been shown from equations (126) and (128) that either equation 
(184) or equation (1385) holds. In a similar manner, it can be shown 
from equations (127) and (128) that either equation (184) or equation 
(135) holds. Thus, we have the conclusion: 


Conclusion 2. When equation (128) holds, either equation (134) or 
equation (135) holds. Physically this means that, when equation (128) 
holds, A&, is positive and approaches zero only when (0/d0p)&(p;,, E,) 
approaches zero. 

It is not easy to determine whether equation (128) is satisfied in 
‘practice because the constant C depends on E, , ~, and ¢ and is some- 
what difficult to compute. Hence, we wish to replace equation (128) 
with inequality (63) in Proposition 2. 

From the definition of v, in equation (114) and from equations (117) 
and (118), we can write 


2Env, = 4[cos pJE.E + 4[sin p,JE;¢. (136) 


From equations (136) and (22), one can show that 
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‘Ge = & (px ; E,) = 4E/E COS px + 4E/T sin Pk (137) 
where 


L, = EiyE, + Do Gh. 


h=-0 


Note that the terms E/yE, , >0%_,, @ , ELE, and E/¢, in equation (137) 
are independent of p. Furthermore, equation (137) holds for all p,. 
Letting p, = 0 in equation (137), we obtain 


DL, — [&(px Ex) loro = 4E7é (138) 


where [8(p; , Ex)],,-0 18 &(o , Ex) evaluated at p, = 0. Since &(p, , E,) 
= 0 for all p, , we have from equation (188) 


DL, 2 4Ezé. (139) 
It can be shown from equation (22) that 
[S(px , Ex)]oxn0 < 2Ly . (140) 
From equations (140) and (188), 
—Iy, < 4ELE. (141) 
From equations (139) and (141), 
(Ej) S eli. (142) 


We have obtained equation (142) by letting p, = 0 in equation (137). 
If we let p, = 7/2 in equation (137), we obtain, in a similar manner, 


(EiQ)* = eli. (143) 
From equations (142), (148), and (122), 
1 
Cs 302)! Ti . (144) 


Using sampling theory, we can verify that 


L, = apf [7 s°(2) at|. + 2fo [ ¢ (1) dt (145) 


where [f%,, s(t) dé], is [%,, s°(t) dt evaluated at E = E,. 
It can be seen from equations (144) and (145) that, if equation (63) 
holds, equation (128) is satisfied. Thus, we have the conclusion: 


Conclusion 8. When equation (63) holds, equation (128) holds and, 
from Conclusion 2, either equation (134) or equation (135) holds. Hence, 
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when equation (63) holds, A&, is positive and approaches zero only when 
(0/dp)&(p, , Ex) approaches zero. 

Now, let us summarize the results presented in this appendix. It has 
been shown that when equation (62) holds in ®, , A&, is bounded below 
by equation (112) (see Conclusion 1). It has also been shown that when 
equation (63) holds in K,, A&, is bounded below by either equation 
(134) or equation (135) (see Conclusion 3). It is seen from the lower 
bounds in equations (112), (134), and (135) that A&, is positive and 
approaches zero only when the partial derivatives approach zero. Since 
*, and ®, each contain an infinite number of k’s and since the mean- 
square error & cannot reduce without bound, & must converge to a 
stationary point. This proves Proposition 2 in Section VI. 
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A General Approach to Twin-T Design and 
Its Application to Hybrid Integrated 


Linear Active Networks 


By G. S. MOSCHYTZ 
(Manuscript received January 2, 1970) 


In this paper we approach twin-T design with a view to controlling the 
sensitivity of the transmission zero with respect to component variations, 
according to criteria that are of particular interest in the design of hybrid 
integrated linear active networks. We give design examples and derive 
conditions that relate null depth and component characteristics with ex- 
pected zero displacement in the s-plane. 


I. INTRODUCTION 


In 1934, H. W. Augustadt invented the twin-T network while carrying 
out investigations for an economical rectifier filter for phonograph 
amplifiers.’ The two main fields of application of the twin-T network 
were introduced in 1938 by H. H. Scott who discussed its uses as a 
feedback network to obtain highly selective amplifiers and stable 
oscillators.” In the following years the circuit was thoroughly analyzed 
in the unloaded state* ° and, later, in the loaded state when driven from 
a nonideal voltage source.’ Consideration was also given to the net- 
work’s selectivity properties and to the effects of loading and network 
asymmetry.'’-"” In the early 1960s, a somewhat new application was 
introduced for the twin-T when synthesis methods based on root locus 
techniques were developed to employ the twin-T as a compensation 
network in de servo systems.’*~*’ These investigations were limited to 
the symmetrical twin-T with fixed source and load resistances. They 
were later expanded to include wide ranges of source and load imped- 
ances’ and to provide prescribed pole-zero locations” using parameter 
plane techniques. 

Recently, with the advent of linear integrated circuits, interest 
in the twin-T network has been revived yet again, this time by network 
theoreticians attempting to generate, by RC network synthesis tech- 
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niques, a still wider range of pole-zero configurations than had hitherto 
been possible.’*~”* At the same time, numerous methods of active RC 
filter synthesis were developed that rely on the basic frequency char- 
acteristics of a twin-T network or modifications thereof, to provide the 
required filtering properties.” °° These methods depend, for their fre- 
quency stability on the stability of the twin-T network. To ensure a 
very high degree of stability the twin-T has been realized by tantalum 
thin film components and then combined with silicon integrated active 
circuits to produce hybrid integrated filter networks.** In applications 
of this kind null frequency and null depth tuning procedures become 
very critical, particularly because thin film resistors can only be ad- 
justed in the increasing direction; furthermore the null characteristics 
(gain and phase) become more important than in more conventional 
applications, and adjustments of these characteristics should not only 
be possible but also simple. It is with respect to these problems that the 
twin-T network is reexamined once again here. 

The requirement that the six components of a twin-T network provide 
a perfect null, that is, a pair of imaginary zeros, at a particular fre- 
quency imposes only two design constraints on the network. A third 
results from the impedance scaling factor chosen for the network. Thus, 
three parameters remain to be chosen by whatever criteria seem most 
important for a given application. Most often circuit simplicity domi- 
nates this choice, resulting in the symmetrical twin-T. In other instances, 
practical considerations requiring that either all the resistors or all the 
capacitors be equal will determine the choice. In those cases where the 
network is synthesized to provide other than standard pole-zero loca- 
tions, no choice exists at all, since all the network parameters are 
generally accounted for. 

In this paper, we select the three unconstrained design parameters 
in such a way as to control the null characteristics of the twin-T ac- 
cording to criteria of particular importance in the design of linear active 
networks. In such networks the twin-T is generally part of a positive 
or negative feedback configuration whose closed loop poles are closely 
tied to the open loop zeros on the jw-axis. The latter are generated by 
the transmission null of the twin-T network. The higher the Q of the 
network, the closer the tie between the closed loop poles and the open 
loop zeros and, consequently, the more critical the sensitivity and sta- 
bility of the twin-T transmission null. To obtain a measure for both, 
the zero sensitivity functions for the commonly used and for the general 
twin-T configurations are derived first. By selecting the three design 
parameters remaining in the sensitivity functions of the general twin-T 


TWIN-T DESIGN 1107 


appropriately, it is found that a relatively wide range of sensitivity 
criteria can be met. Some of these are useful in contributing to the 
overall stability of an active feedback network incorporating a twin-T. 
Others are of interest in considerations pertaining to useful frequency 
and null depth tuning strategies in the vicinity of a perfect twin-T null. 

To guarantee stability, conditions are also derived here that prevent 
the twin-T transmission zeros from drifting to the right half s-plane. 
This implies a maximum permissible null depth of the twin-T that can 
be expressed in terms of the twin-T design parameters and the tempera- 
ture and aging characteristics of its components. 


II. CIRCUIT ANALYSIS OF THE GENERAL TWIN-T 


The voltage transfer function of the general twin-T shown in Fig. 1 
is given by the ratio of two third-order polynomials, namely 


B-TO-Dotipietieee 
where 
a, = R3(C, + C,), (2a) 
a2 = R3(Ri + RCC, , (2b) 
az; = R,RR3C1C2C3 , (2c) 
b, = R3(C, + C.) + RC. + Ri(C. + Cs), (2d) 
b. = Rs[RiC3(C, + C.) + (Ri + R2)CiC2] + RRC, , (2e) 
b; = RiRR2CiC2C3 = as , (2f) 
and 


Ss =oa-+ jo. 


The null, or transmission zero, of the twin-T is defined by the roots 


Ry Ra 


Cy Ce 


Fig. 1—General twin-T network. 
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of N(s) on the imaginary axis. Thus for 
N(s) |.-10 = 0 (3) 


the null frequency or conjugate complex transmission zeros 2,2 = 
jwy are obtained. 
Using the substitutions 





CC, 
C~, i. Ci 4. CG ’ (4a) 
C, = C1 + C, ’ (4b) 
R=kh, +R, (4c) 
RR, 
Rk, = R, a R, ’ (4d) 


the following two conditions for the twin-T null frequency result from 
equation (3) 


1 
2 ee eminent, 
ov = BRCC, (5) 
and 
Cy _ Cy, 
R; ~~ R, (6) 
These can be combined as follows 
2 1 = 1 
on RRC,C;  RR3C,C2 (7) 


Thus, for a perfect null the transmission zero of each of the two bridged- 
Ts obtained by disconnecting R; and C3, respectively, of the twin-T 
(see Fig. 2) must be equal. This fact has been used to develop a 2-step 
tuning method for the twin-T.” Substituting equations (5) and (6) 
into equation (1) gives the transfer polynomials of the nulled twin-T 


N(s) = tise (s a) (s° + wy) (8) 


and 


ze iG + (8R.c. + a) + wi. (9) 


The open-circuit impedance matrix of the perfectly nulled twin-T 





D(s) = a (s + 
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R, Ro 





Fig. 2—Bridged-T networks derived from general twin-T shown in Fig. 1, whose 
natural frequencies coincide with twin-T null frequency. (a) R; disconnected; 
(b) Cs disconnected. 


is given in Appendix A. The voltage transfer function follows from equa- 
tions (8) and (9), namely 


2 2 
Ty (3) = § ~ On (10) 
gs +s t+ ox 
Qn 
where wy is given by equation (7), 
sep MON Nee 
dv = 2on wv(R, C3 + RC.) (11) 
and 
2on => wyltiCs -++ u : : (12) 








RC, a RC, RC, 


Inspection of equations (8) and (9) shows that the two third-order 
polynomials of the transfer function of a general twin-T are simplified 
by one degree due to pole-zero cancellation when the conditions for a 
perfect null are satisfied. It is shown in Appendix B that even when the 
twin-T null is not perfect (that is, the transmission zeros are not on but 
only close to the jw-axis), this pole-zero cancellation still takes place, 
provided that 2,C, = RC. . 

The most frequently used twin-T is structurally (and electrically) 
symmetrical (see Appendix A). For this case (see Fig. 3a) R; = R, = 
R, R; = R/2,C, = C, = C, C3 = 2C, and the coefficients of equation 
(10) are wy = 1/RC, 207 = 4/RC and gy = 4. Another commonly used 
version of the twin-T is the potentially symmetrical configuration (see 
Appendix A). This is obtained by impedance scaling one-half of the sym- 
metrical twin-T by some factor p. The resulting twin-T elements are 


1110 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1970 





R R 
Cc Cc 
R/2 2c 
(a) (b) 
R R 2R 
Cc C/2 C Cc 
R 3c R/3 Cc 
(c) (d) 


Fig. 3—Frequently used twin-T configurations. (a) Symmetrical; (b) Potentially 
symmetrical; (c) Equal resistors; (d) Equal capacitors. 


(see Fig. 3b) Ry = R, R, = pR, Rs = pR/(1 + p), Ci = C, Cz = C/p, 
and C; = (1 + p)C/p. The coefficients of the transfer function [equation 
(10)] for this case are wy = 1/RC, 2on = (2/RC)(1 + p)/p and qn = 
3po/(1 + p). Notice that for the extreme asymmetrical case for which 
p > 1, gy takes on its maximum value of 0.5. 

Sometimes it is useful to make all the resistors of the twin-T equal. 
This enables one to gang three variable resistors in order to vary the 
null frequency. The twin-T elements are then (see Fig. 8c) R, = R. = 
R; = R,C, = C, C, = C/2, C3; = 3C, and the coefficients of the transfer 
function are the same as those of the symmetrical twin-T. Similarly, if 
the three capacitors are to be made equal for ganging or other purposes, 
the twin-T elements are (see Fig. 3d) R, = R, R. = 2k, Rs = R/B. 
Here again the coefficients of the transfer function are the same as those 
of the symmetrical twin-T. 


Ill. SENSITIVITY OF TWIN-T NULL CHARACTERISTICS TO COMPONENT 
VARIATIONS 


The null or zero sensitivity of the twin-T to variations of any com- 
ponent x gives a measure for the degree of change of the transmission 
characteristics in the vicinity of the twin-T null frequency as a result 
of variations of a component x. Referring to the complex frequency 
plane, the zero (or pole) sensitivity gives a measure for the zero (or pole) 
displacement due to an incremental change in the value of the component 
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z. It is defined by*® 


bo 202 
ie dx/x 


where z is the complex null frequency of the network. 

The zero displacement dz in the s-plane has a real and an imaginary 
component. It defines a vector or the direction in which a zero (or pole) 
travels from its initial location with incremental changes of a com- 
ponent «. Since x and therefore dx/x must be real, the zero (pole) 
sensitivity defines a vector in the same direction as the zero (pole) 
displacement dz. Herein lies the importance of knowing the root (that is, 
pole or zero) sensitivity of a network since it provides insight into the 
stability of a system with respect to the component x that is expected 
to vary the most. It also provides information relevant to network 
tuning since it relates adjustments of any component x to its effect on 
the roots of the transfer function. Conversely, as we shall see later, a 
network can be designed to provide a prescribed sensitivity between 
some parameter of the transfer function such as the displacement of a 
specific transmission zero and the variation of some preselected com- 
ponent x. The choice of sensitivity may be such as to result in a network 
that responds to a simplified tuning strategy or whose characteristics 
may be adjusted in a desirable way by the component zx. 

The most important characteristic of the twin-T is its behavior in 
the region of the frequency null. In the s-plane this behavior is char- 
acterized by the sensitivity of the transmission zero, which is initially 
located on the imaginary axis for a perfect null. The sensitivities of this 
zero, that is, 2 = jwy , to each of the six components of the general twin- 
T have therefore been derived in Appendix C and listed in Table I. 
In doing so it has been found useful to characterize the general twin-T 
by the following parameters 





(13) 














=a Ry (14) 
C2 
rs C. ae CG. , (15) 
and 
wo _ (RC) 
cae Wy 7 (Re) (16) 


d and » give a measure for the degree of symmetry of the series elements 
of the twin-T; a relates the series elements to the shunt elements. 


TaBLE ]—ZxERO-SENSITIVITY FUNCTIONS FOR A GENERAL NuLLED Twinx-T 


Fis Wy en, hd jon _ _____ Wy = : v 
sit" = pitts [1 : i(4 + aa) | sit = aft [1 »tile+2)| 


jon _ _ Gn aa Jl AL = POR, 2 ON. : (Lo) 
8k, = a0. + a) I itt + a(l | Se, ie a) E = ita = 7 S| 





























jon _ eens AWN _ ‘ TWN = QO) x sao rs G 

Se." = —o7 + oF Ut + 32) Se. say (1 i) 

where: 
ee Seen Ne _ _— _Fiky 
ov = BRCCs ~ RRsC.Cs Hescedearde tates, SG yea ape 

ee _— _ OC, , _ 
OES ae TF RG. Ces ora gs: “Ges Gyan’, 
Ri C; 
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A useful check for the validity of the expressions in Table I is that 
they must satisfy the following condition for the root sensitivity of a 
passive RC network if the relative resistor and capacitor changes are 
assumed to track** 


» She = DL Se, = =z. (17) 


Let us now consider the zero sensitivity of the commonly used twin-T 
configurations shown in Fig. 3. 


3.1 Symmetrical Twin-T 


This case is characterized by the parameters wy = 1/RC, a = 1, and 
\ = v = 0.5. The resulting zero sensitivity functions are listed in Table 
II, Part 1 and the corresponding zero displacements in the complex 
frequency plane are shown graphically in Fig. 4a*. This diagram also 
demonstrates the realization of the condition for passive RC networks 
given by equation (17). Therefore, by assuming tracking and equal 
but opposite temperature coefficients of the resistors and capacitors, 
temperature drift of the null frequency can theoretically be eliminated 
completely. If tracking of like component variations does not occur, — 
the drift displacement due to the symmetrical elements R, and FR, and 
of C, and C;, are identical. The displacements due to the shunt elements 
FR; and C; have approximately the same value but follow a somewhat 
less steep slope. Thus if the twin-T is being used in a feedback network 
to generate conjugate complex poles in the left half plane close to the 
jw-axis, the stability of the network will be more sensitive to drift in 
the shunt elements than to drift of those in series. 


3.2 Potentially Symmetrical Twin-T 


This case is characterized by the network parameters wy = 1/RC, 
a=1l,and\ =v = 1/(1 + p). The resulting zero-sensitivity functions 
are listed in Table II, Part 2. Since they depend on the symmetry coef- 
ficient p, some control on the sensitivity can be expected by this coeffi- 
cient. Table IJ, Parts 2a and 2b list the sensitivity functions for the two 
extremes, that is, when p is much larger and much smaller than unity, 
respectively. The corresponding zero displacements are shown in Fig. 
4b. These two complementary cases can be thought of as having evolved 
from the symmetrical case (Fig. 4a) by spreading out the displacement 

* As pointed out earlier, the root sensitivity function given by equation (13) 
defines a differential vector in the complex s-plane. It can be shown that this vector 
lies on the branch of the root locus with respect to a component zx that starts at z 


or, in other words, that the root displacement Az and the root sensitivity have the 
same argument. 


Tasue []—Zero-Sensitiviry FuNcTrIONS FOR CoMMONLY Usep Twin-T CoNFIGURATIONS 


1. Symmetrical Twin-T 


I 
lH 
R 
Il 
— 
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I 
I 
Il 
ne 
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I 
eH 


(R, = R, = 2R; = RF; Cy = C, = C,/2 = C): Ov = pO? Qn 
SitY = Sit” = =X (2 — $j) = 0.895 wy/—71980' | sit” = sit¥ = —=2 (4 + $j) = —0.395/71°30’ 


sun 


com 


1 — 9) = 0.854/—45° 
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ae (1 + 7) = —0.354/45° gig” = oe 


2. Potentially Symmetrical Twin-T 











Co “sth 5 ( | 1 p 
> Ov= ani Q=s ; @=1; ’\=~v=— 5 7 3 
Cc’ 2\1 1 : 
faerie e +p + p G+? 
jon _ ON Pp 2 +2) te ie (2 p 2) 
aan ‘cer T+ Se a ig ep 
sit = 2 ( 1 — ji +22) sign = 2 ( 1 + ji4 22) 


aa 


I 


__ On ‘ joN = Wn ee 
4 Ciara) Sc: 4 (1 — j) 


vITT 


0L6I Lsapnv—-Ator “IVNUNOL IVOINHOGL WHLSAS TIGG FHL 


TaBLeE II—Cont’d 











2A. Potentially Symmetrical Twin-T (S> 1): (p> 1): qy—: 
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2B. Potentially Symmetrical Twin-T (S«1): (p <1): 
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3. Twin-T with Equal Resistors 


Gy =] fe = he = Re Cy = 20. = 6/4 = 6): Oy = 








qa (2 + 3.57) = —0.276 wy/60° 


=a (1 + 57) = —0.3892 wy/T9° 





= a (3 — 4.57) = 0.416 wy./—56° 


SifX = git™ = 7S (1.5 — 5.57) = 0.44 wy/—75° So,” = 
- Ses” = 25 (8 + 2j) = —0.278 wy/34° SS 
Sc." 
4. Twin-T with Equal Capacitors 
Ghee = ohy = he, =] es = CH=): wy = 55 
sit* = 7 (2 — 3.57) = 0.276 wy/—60° sien 
sie) = (1 — 5j) = 0.392 wy/—79° Se 
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vectors corresponding to the series components R,, R,z and C,, C,, 
respectively, and leaving the vectors corresponding to the shunt com- 
ponents R; and C, unchanged. The noteworthy feature of these two 
extreme cases is that they both provide for null frequency adjustment 
(if only over differentially small frequency ranges) by a single com- 
ponent, namely by R, or C, when p > 1 and by R, or C, when p <1. 
Of the two cases, the former is preferable since it simultaneously pro- 
vides higher selectivity than a symmetrical twin-T (that is, gy ap- 
proaches its maximum value). More will be said about zero-sensitivity 
and its effect on network adjustability in Section IV. 


3.3 Twin-T With Equal Resistors 


For this case wy = 1/RC, a = 3,\ = 4, andv = 3. The corresponding 
zero-sensitivity functions are given in Table IJ, Part 3, and the zero 
displacements are shown graphically in Fig. 4c. Since the series resistors 
are equal in this case, the corresponding sensitivity functions are also 
equal. However, the sensitivity function of the shunt resistor is smaller 
in value and flatter in slope. Therefore, in order to shift the null fre- 
quency accurately, a high precision ganged 3-resistor potentiometer 
must be used whose resistor values track very closely. 


3.4 Twin-T With Equal Capacitors 


Here we have the design parameters wy = 1/RC, a = 3, \ = 3, and 
y = 3, The corresponding sensitivity functions are listed in Table II, 
Part 4. Since this case is the dual of the equal-resistor case discussed 
above, the displacement vectors are negative and conjugate to those 
of Fig. 4c. Generally high precision ganged resistors are more readily 
available than capacitors, so that for variable null-frequency tuning 
purposes this case is less practical than the preceding one. 


IV. NOVEL TWIN-T NETWORKS WITH PRESCRIBED TUNING CHARACTERISTICS 


In the preceding section, the null sensitivity of component variations 
was discussed with respect to the most commonly used twin-T configura- 
tions. In this section the expressions for the general nulled twin-T, that 
is, those satisfying only the null conditions given by equations (5) and 
(6), are reexamined in relation to the corresponding zero sensitivity 
functions listed in Table I. In particular we investigate how the remain- 
ing twin-T parameters that are not constrained by the two null condi- 
tions can be utilized to modify the dependence of the null character- 
istics to adjustments of certain twin-I components in such a way as to 


1118 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1970 








DECREASING Cj,C2 
DECREASING C3 


JON w er 
~y Soy No (4j)=jJon 


R; = Ro = 2R3=R 


-4}j 
Ci= Ca=C3/2=C (-4)) 
S = ¢+Jjw 
INCREASING R3 
(a) ¢ 
jo & ~-DECREASING Ca (C)) 
DECREASING C3 (C3) DECREASING Cy (Ca) 
= jon 
R; =R Cy=C 


R2=pR Co= C/p 


p {+p 
Rgece-R (oGet*£c 
It p p INCREASING 


p>>1 (p<<t) INCREASING R; (Ro) 
R3 (Ra) 
INCREASING 
| Ra (Ri) 
- 1 2 3 4 #5 
Co 
(b) 


Fig. 4—Zero displacements for frequently used twin-T configurations. (a) Sym- 
metrical; (b) Potentially symmetrical; (c) Equal resistors. 
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DECREASING C3 


DECREASING Cy 


INCREASING R3 


Ri,Ra 
Cy = C3C2 =C/2;C3 =3C 
3 
2 
1 
{234567 8 
o 
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satisfy various tuning strategies that are particularly useful in practical 
applications. 

The dependence of twin-T transmission characteristics in the vicinity 
of the null frequency on variations of any component 2 are essentially 
determined by the zero sensitivity functions listed in Table I. Design 
equations for twin-T networks providing a specified dependence of null 
characteristics on the adjustment of a particular component x may, 
therefore, be found by setting corresponding constraints on the zero 
sensitivity functions and solving the resulting equations for the twin-T 
components. Instead of designing a twin-T to a specified dependence of 
transmission characteristics in the vicinity of the null frequency to varia- 
tions of a given component xz, it therefore suffices for us to design a 
twin-T to a specified zero sensitivity with respect to x. 
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The expressions listed in Table I show that, after the null frequency 
wy has been specified, there are basically three parameters left to deter- 
mine in order to design a twin-T to a prescribed zero sensitivity. These 
are the frequency ratio a, the resistor ratio \, and the capacitor ratio v. 
A parameter that sometimes provides clearer physical insight into the 
design of a twin-T than the frequency ratio « is the ratio of series to 
shunt capacitors y, namely 


CG 


Y — C; = yp C,. (18) 
The two parameters are related by the expression 
: Fe = I ae) 


a has been plotted in Fig. 5 as a function of \ with the parameter y. 
The values of y for the common twin-T configurations are included in 
Table II. From the defining equations, the limits on the four twin-T 
parameters are 


0O<a<o, (20a) 
0O<y<o~, (20b) 
Ox <1, (20c) 
0<v<l. (20d) 


A fundamental characteristic of the twin-T is its ability to reject a 
narrow frequency band centered at the null frequency fy and to pass, 
substantially unattenuated, the frequencies on either side of this band. 
A useful parameter characterizing the selectivity of frequency rejection 
is the inverse damping factor gy [see equation (11)] also known as the 
pole Q. Physically qy is the ratio of the center frequency fy divided by 
the bandwidth at which 3 dB attenuation occurs* (see Fig. 6a). 

It is important, while examining the effects of the parameters listed 
in equation (20) on the zero sensitivity of the twin-T to keep an eye on 
their effect on the twin-T selectivity as expressed by gq, . Obviously, 
poor selectivity might be too high a price to pay for any set of controlled 
sensitivity functions. On the other hand, because the twin-T is a passive 
RC network, the selectivity factor gy only has a narrow range of realiza- 


* This definition is only accurate for unloaded twin-T networks such as those 
being considered here. For the case of a loaded twin-T with an unsymmetrical 
frequency response, it has been found more useful to define selectivity as the slope 
of the phase ¢ at the null frequency, that is, by r(wy) = dé/dolama y: 
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Fig. 5—Frequency ratio a as a function of \ and parameter y. 


bility as it is, that is, 
0 <q < 0.5 (21) 


whereby the value 0.25 is realized the most frequently, namely with 
the symmetrical as well as with the equal resistor or equal capacitor 
twin-T configurations. However even within the limited range given 
by equation (21), the difference in actual frequency selectivity can be 
quite significant. This is illustrated in Fig. 6b where twin-T frequency 
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Fig. 6—Twin-T selectivity. (a) Definition of gy; (b) Frequency response for 
various qgy-values. 


-response curves have been plotted as a function of the parameter qy . 
Expressing gy by the same parameters as are used for the zero sensi- 
tivity functions in Table I we have 


__ai-y)d—r) _ d—»)rGd — vF, 
Lag eae Ce OP aaa IE Ae Ve ET (22) 
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With equation (22) we can easily observe the effects on selectivity that 
any sensitivity constraints on the parameters \, v, a or y may have. 

After these preliminary remarks, let us now consider in what ways 
and by which criteria it would be useful in practice to control the zero 
sensitivity functions given in Table I. Due to the RC self-duality of the 
twin-T,** we need only consider either the resistor or the capacitor 
functions. Since both discrete and (thin film) integrated RC networks are 
generally tuned by variable or anodizable resistors, the zero sensitivity 
functions with respect to these will be examined. 


4.1 Frequency Tuning by One Component 


By making the real part of any one of the three sensitivity functions 
go to zero, it is possible to shift the null frequency accurately over a 
limited frequency range by varying only the one corresponding resistor 
rather than two as would be necessary in general. 

4.1.1 Frequency Tuning with R, 


Here we require that 
Re si?” — 0. (23) 


By inspection of Table I this condition is fulfilled if\ > 1, or R, > R,. 
Then 


Sa i>, (24a) 
iON AY _ abn g 

Sas S50 + ow) ( - 4) eee) 

Si.” x aa + 3) igs 2) (1 + ja). (24c) 


Equation (24c) remains the same as for the general case, which is in- 
dependent of \. However, from equation (22) we find that 


Qw lana — 0. (25) 


Therefore, condition (23) can only be realized at the expense of selec- 
tivity. Incidentally, the potentially symmetrical twin-T for which 
p <1 (see Table II, Part 2b) is a special case of the one treated here, 
namely, that for which a = lI. 

4.1.2 Frequency Tuning with R. 


We require that 
Re 8°” — 0. (26) 


1124 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1970 


This condition can be approached if \ — 0, or R, > R,. We then 
obtain: 


iON WA, awn — g 
Sa 2(1 + a”) ( i) : (27a) 
giew ny en (27b) 
2 
$ ON oo ON. 
Sr, 3 2(1 4 a’) (1 =e jot) . (27c) 


Aside from interchanging the sensitivities with respect to R, and R, , 
these expressions are the same as the preceding ones (equations 24a, b, 
and c). However, there is one important difference, namely in the selec- 
tivity which may now actually be larger than the “symmetrical” value 
of 0.25. From equation (22) we have 


a(l —y) 


Qn Loo as Rin aay (28) 


Depending on the choice of a and », gy can be made to approach its 
maximum value of 0.5. Here again the potentially symmetrical con- 
figuration for which p > 1 is a special case, namely, that for which a = 1 
and v approaches zero in the same manner as \ does. This is one of a 
variety of possible cases for which equation (28) approaches 0.5. Other 
combinations of « and » are best obtained by plotting equation (28) on 
semilog paper as shown in Fig. 7. By setting the derivative of equation 
(28) with respect to a equal to zero one obtains 


Oma = (1 — v)? (29) 
and 


is 
Qmax = Me (30) 


Expression (80) is also shown in Fig. 7 by the dashed curve. Clearly 
there is a wide practical range of twin-T networks, with good-to-excel- 
lent selectivity, that will satisfy condition (26) and thus provide simple 
frequency tuning over a restricted frequency range. 

One of the disadvantages of the twin-T configurations described here 
is that R., the frequency tuning resistor, is “floating,” that is, it does 
not have one of its terminals connected to ground. Thus if it should be 
desired to switch various values of Fz in order to filter or to generate 
different discrete frequencies, hard-contact switches would generally be 
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Fig. 7—Selectivity factor gy as function of a and parameter y, for special case 
that \ > 0. 


necessary since rapid semiconductor switching circuits are difficult to 
design in the floating mode, especially if transformers are to be avoided. 
For this reason the next case is of particular interest. 


4.1.3 Frequency Tuning with Rs 


The required condition here is that 
Re Si? — 0. (31) 


Inspection of Table I shows that this condition cannot be realized 
under any circumstances. However, the following equivalent condition 
can be realized 


Rese < ise (32) 


if a > 1. Referring to Fig. 5, it is clear that this condition can be ob- 
tained in two ways, namely either by letting \ approach zero or unity 
with a medium value for y or simply by letting y become very large. 
However, by inspection of equation (22), both methods result in low 
gy values. Thus, although the tuning resistor has one terminal grounded 
which does have certain advantages in terms of circuit implementation, 
these may be offset by the low selectivity obtainable. 
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4.2 Null-Depth Tuning by One Component 


In some applications it may be desirable to make adjustments in the 
null depth of a twin-T after it has been initially tuned for a perfect 
null. This can be achieved with a single component (for example, 
resistor) if the imaginary part of the sensitivity function with respect to 
that component can be made equal to zero. By considering the general 
sensitivity functions given in Table I and again restricting ourselves to 
variable resistors for practical reasons (and because of RC-duality of 
the twin-T), we obtain the following three cases: 


4.2.1 Null-Depth Tuning with R, 
Here we require that 
T,832" — 0. (33) 


The minimum of the imaginary part of $1” occurs when \ = 1/2” 
in which case the sensitivity functions become 








TON __ AWN Pen i = 2) ~N On 

Pee 2(1 + a’) ( ee ay ie Dae (S48) 
fii awy a, 5 ) ~ 4, ON 

Sr, => 21 a a’) (4 ja wie j 9 ; (34b) 
iON _ QAWn . AG On | 

Sr, = 2(1 + a’) (1 ++ jez) ee Bec i 9 (34¢) 





As shown in the above expressions, condition (33) is realized by (84a) 
if a > 1. Furthermore, the other two sensitivity functions turn out to 
be orthogonal to equation (84a) enabling independent null-frequency 
and null-depth control by two individual resistors (for example, R, and 
R, or R, and Rs). 

It will be remembered that any sensitivity functions requiring large 
values of a were dismissed as impractical in the cases presented in 
Sections 4.1.1 to 4.1.3 due to the resulting decrease in selectivity. This 
was quite realistic since at least the case in Section 4.1.2 could be realized 
accurately while maintaining a free choice in the selectivity constant qv . 
We will see in this section that no such freedom exists in any of the cases 
discussed and that any configuration allowing null-depth tuning by one 
component invariably results in selectivity deterioration. Practical 
implementation will therefore require a compromise between the realiza- 
tion of any one of the sensitivity functions and selectivity. However, 
as will be seen, not all the cases discussed here are equally disadvanta- 
geous with respect to this compromise. 
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Substituting \ = 1/a’ into equation (22), we obtain 
(a? — 1)\Q1 — v) | (35) 


ala” — v) 


dx = 


To obtain as large a value as possible for gy , we let vy — 0 in which case 
dv © 1/a. Thus the more accurately we wish to realize expression (84a), 
the smaller the selectivity will be. 
4.2.2 Null-Depth Tuning with R. 
The requirement here means that 
Lie" 0; (36) 
This would be accurately realizable if we could let 
1 
A=1+ >: (37) 
a 
Due to the restriction given by inequality (20c), this is not possible. 
Instead, equation (37) can be approached approximately by letting: 
1-1 (38a) 
and 
a>. (38b) 


By inspection of equation (19) and Fig. 5, inequality (88b) follows 
directly from condition (38a). More specifically, the imaginary part of 
Si°* has a minimum when 


1 
hel—- (39) 


which is of course realizable. With equation (89), the sensitivity func- 
tions are then 











HONE ee: Oy ae .\ pio Sn 

Sk, — 2(1 at a”) (4, ja) fac | 9? (40a) 
GON __ i em A 3) ~N Wn 

Sz. = “a + a’) (1 a + ae sae Dae” (40b) 

sn" = sq gy t+ ie) | IZ (40c) 





Thus equation (40b) provides the desired en function and also, 
as in the preceding case, the other two functions are orthogonal to it 
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allowing for independent frequency and null depth tuning with two 
individual resistors. 

Substituting equation (39) into equation (22), the selectivity function 
becomes 


spy cee 
ia a(2 — v) 


which is largest with respect to » when v —> 0. Then qy & 1/2a which, 
as in the previous case, is all the smaller the more accurately the desired 
sensitivity function (40b) is to be realized. However, the preceding case 
(namely, that in Section 4.2.1) is preferable if it is simply a question 
of finding a component with which to adjust null depth, since for a 
given value of a the selectivity coefficient is twice as large as here. 


(41) 


4.2.3 Null-Depth Tuning with R, 


Here we require that 
T,8i2" — 0. (42) 


From Table I it is evident that to satisfy this condition a — 0. From 
Fig. 5 we see that a has a minimum when \ = 0.5. Furthermore, since 
a is proportional to y, we can select y < 1 in order for a to approach 
zero. From equation (22) we obtain the selectivity coefficient 


a(l — v) 


w= FLO» (48) 


which is maximum with respect to vy when vy > 0. Therefore from equation 
(43) we obtain qy  a/2. The condition for a, in this case, is the inverse 
of that for the two preceding cases. Taking this into consideration while 
comparing the corresponding selectivity coefficients shows the case in 
Section 4.2.1 to have the highest selectivity. It, like that in 4.2.2, has 
the added advantage of orthogonal sensitivity functions providing for 
both the desired null-depth tuning as well as frequency tuning by single 
components in the vicinity of the null frequency. On the other hand, if 
a variable component with one terminal grounded is preferred for the 
reasons given in Section 4.1.2, then the case in Section 4.2.3 may be 
used, provided the selectivity, which is smaller by a factor of two, is 
still acceptable. 


4.3 Orthogonal Tuning With Two Components 


Orthogonality between two zero sensitivity functions simplifies null 
adjustments in the vicinity of a perfect null, particularly if the two 
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functions are parallel with the real and imaginary axes. Some of the 
configurations described in the previous sections provided the latter 
type of orthogonality, but only at the cost of selectivity. General orthog- 
onality, which is discussed here, may be of interest for a variety of 
reasons, for example, the 90 degree phase reference required for tuning 
purposes may be easier to generate than any other arbitrary phase 
reference. 
Two vectors q = u + juand p = w + jz are orthogonal if 


uw + vz = 0. (44) 
Thus, to obtain orthogonality between pairs of the functions listed in 
Table I, we must investigate if they can be made to satisfy this condition. 
4.3.1 Orthogonal Tuning Between R, and R, 
This requires that 


ie eo (2 + an) E aC 2 »)| 20). (45) 


Solving for the roots of this equation one obtains a,,, = +] which is 
not physically realizable. 


4.3.2 Orthogonal Tuning Between R, and Rs; 


To satisfy the condition for orthogonality here, we require that 


S( ) a(4 + an) = 0. (46) 


Solving equation (46) for @ results in the same nonrealizable roots as 
were obtained in Section 4.3.1. However, one additional solution exists 
here, namely } = 0. This condition can only be approximated [see 
inequality (20c)] and has been dealt with in Sections 4.1.2 and 4.2.1 
where, as expected, Si" is orthogonal to $},°” . 


4.3.3 Orthogonal Tuning Between R. and Rs; 
It is required that 


—a + Ab + eG. = ») | = 0 (47) 


which is satisfied when \ = 1. As in the preceding case, this condition 
can only be approximated; it has been dealt with in Sections 4.1.1 and 
4.2.2. 

It is evident from the above that, apart from the cases of orthogonality 
already discussed in earlier sections, the condition for general orthog- 
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onality given by equation (44) does not produce any new realizable 
twin-T configurations. 


4.4 Design Examples of Twin-T Networks With Controlled Tuning Char- 
acteristics 


The design equations for twin-T networks with the tuning character- 
istics described in Sections 4.1 and 4.2 have been compiled in Table III. 
Results from Section 4.3 have not been included since they did not 


TABLE I—Twin-T DeEsIGN EQUATIONS FOR CONTROLLED 
ZERO SENSITIVITY 


Design Equations for 
Controlled Sensitivity 


Design Equations for 
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(that is, grounded) 
terminal 





produce anything not already obtained in the two previous sections. 
Using the design equations listed in Table III, the detailed procedure 
for the design of two twin-T networks with prescribed tuning char- 


acteristics follows. 


4.4.1 Twin-T With Null Frequency Tunable by Rz 


To satisfy condition (26), we find from Table III, Part 1B, that \ « 1 
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and therefore select \ = 0.01. Furthermore, assuming that qy = 0.25, 
wy = 2rl kHz, and R, = 1 KQ, we find R, = 99 KQ. From Fig. 7 we 
find that the gnax-curve passes through the value 0.25 when a = 0.5. 
However, with \ = 0.01, y takes on a simpler value for a = 0.55 (see 
Fig. 5), namely 
y = a (1 — d) = 0.003. 
Solving equation (28) for » we obtain 
2 
Qa 


y= l—pt 5 = 0% 





and from equation (18) 
Ge = Cs = 0.004C, * 
T’rom equations (5) and (18) 


] 
C, = — ess = :«C0«.292 uF 
> ov(yRiRs)* . 


and 
C, = *¢, = 1.168 nF. 
Finally, with equation (15) 


C, = —— C, = 3.504nF 
l-vp 
and, from equation (6) 


R, = &.R, = 62K. 


The corresponding sensitivity functions can be calculated directly by 
substituting the values obtained above into the expressions listed in 
Table I. Considering only the relative values of the resistor sensitivity 
functions, one obtains 


AWy 


TON __ f= : 
si” = sq 4 a) (0.99 — 1.833), 
GON __ AWyv a. . 
se" = oq 4 a (0.01 — 2.365;), 
sige = — On (1 + 0.553). 


2(1 + a’) 
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The twin-T network resulting from the above calculations, as well as 
the zero displacement vectors given above, are shown in Fig. 8. In Fig. 
9a, measurements of the frequency response of this twin-T are compared 
with those of a symmetrical twin-T nulled at the same frequency. The 
initial frequency response of the two ideally nulled networks are identical 
since both have a selectivity constant gy equal to 0.25. On varying R, , 
however, the null depth of the symmetrical network decreases con- 
siderably with varying null frequency compared to that of the non- 
symmetrical configuration. This is also apparent from Fig. 9b, where 
the percentage frequency shift of the null is shown as a function of the 
relative resistor change R,. Whereas this curve is not appreciably 
different for the two configurations, the simultaneous variation in 
null depth is. 


4.4.2 Twin-T with Null Depth Tunable by R; 


To satisfy condition (42), we find from Table III, Part 2C, that 
a <1 and select a = 0.1. Furthermore, with \ = 0.5 and letting » = 
0.01, for maximum selectivity, we find from equation (22) that gy = 
0.048. 

As in the previous example, we assume that wy = 21 kHz and, be- 
cause \ = 0.5, we select R, = R, = 10 KQ. From equation (19) we find 

= 0.0025 and, in precisely the same way as in the preceding example, 
Cz = 0.318 uF, C, = 79.5 nF, C, = 0.804 nF, and Rk; = 19.8 K. The 
corresponding zero sensitivity functions follow directly from Table I. 
The relative values of the resistor sensitivity functions are 


_ an 


1 + a”) 


ION __ 
Sri _ 


5 (0.5 — 10.053), 
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0.292 4F 


INCREASING R3 
INCREASING Ry 
INCREASING Rp 


Fig. 8—Zero displacement and twin-T configuration for wy = 27:1 kHz, qy = 0.25, 
A = 0.01, » = 0.75, a = 0.55. 


1134 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1970 


_~— FREQUENCY RESPONSE FOR SYMMETRICAL 
0 AND NONSYMMETRICAL TWIN-T 





-20 

Y 

_! 
w 

a 

if “40 

a 

z = $518%--- ON 

z 

= -60 

3S =+25.5% 

~~=- 35.2% 
-80 
— -— SYMMETRICAL 
—-—-— NONSYMMETRICAL 
-100 
0.1 0.2 0.4 06 08 1 2 4 6 8 10 
FREQUENCY IN KHZ 

| cece | 
XN 
Lt 
z 
qe} Zz 
aie 


-20 


—-— SYMMETRICAL 
-40 ——— NONSYMMETRICAL 





~100 -80 -60 -40 -20 0 20 40 60 80 100 





Fig. 9—Comparison of measurements conducted on twin-T shown in Fig. 8 and 
symmetrical twin-T. (a) Null-frequency shift and null-depth variation with variation 
of R2; (b) Twin-T null frequency as function of percentage change in Re. 


fon _ _ AWN < 
82. = +a) Tie 2) (0.5 — 10.05y), 
SS 30 FF aa 5) (1 + 0.19), 


The resulting twin-T network and the zero displacement vectors are 
shown in Fig. 10. Measurements made on the twin-T are shown in Fig. 
lla where they are compared with those of a symmetrical network 
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0.318 
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Fig. 10—Zero displacement and twin-T configuration for wy = 27-1 kHz, qy = 
0.048, \ = 0.5, » = 0.01,a = 0.1 


with the same null frequency. The initial frequency response of the two 
configurations differs here, since the selectivity coefficient of the sym- 
metrical twin-T is 0.25, and that of the other is 0.048. The null depth 
of the symmetrical twin-T can be decreased by more than 50 dB from 
an initial 90-dB null with no measurable change in the null frequency. 
This compares with over 1 percent variation of null frequency for the 
symmetrical configuration. This is shown again in Fig. 11b where the 
null depth variation is plotted versus relative change in the resistor R; . 


V. TWIN-T NULL STABILITY USING THIN FILM COMPONENTS 


The twin-T is frequently used to provide stable zeros in the design 
of hybrid integrated linear active networks. If a high degree of stability 
is required, thin film components must be used for the twin-T network. 
Just what degree of null stability can be expected with thin film com- 
ponents whose temperature coefficients and aging characteristics are 
known, follows directly from the sensitivity functions discussed in the 
previous sections. This is shown in the following. 

A displacement dz in the transmission zero z of a twin-T network can 
be expressed in terms of the zero sensitivity defined by equation (13) 
as follows 


3 
dz = ye se, He pS 86, a, (48) 


As shown in Fig. 12, if the twin-T transmission zero is close to the juw- 
axis it can be considered purely imaginary for purposes of computing 
sensitivities, thus 


Stas Geer. (49) 
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Fig. 11—Comparison of measurements conducted on twin-T shown in Fig. 10 and 
symmetrical twin-T. (a) Null-frequency shift and null-depth variation with variation 
of R3; (b) Twin-T null depth as function of percentage change in Rs. 
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Fig. 12—Typical root locus of feedback network using a twin-T to provide the 
open loop zeros. 


Furthermore, referring again to Fig. 12, the zero displacement is given 
as 


dz = do + djwy . (50) 


Equating (48) and (50) and substituting the expressions of Table I 
in equation (48), it is possible to solve for do and djwy . 

First, however, some characteristics peculiar to thin film integrated 
circuitry must be considered. Due to the batch processing techniques 
used, component uniformity can be guaranteed much more accurately 
than with discrete components. Above all, component variations tend 
to track very closely on a given glass or ceramic substrate and these 
variations can be precisely predicted and controlled. These features 
permit a considerable simplification in the following calculations without 
any loss in accuracy. Thus, we can write 


sales [, te] AT +4, = (51) 
and 
~ = (6, 4¢]AT +x. = a. (52) 


The temperature coefficients of the resistors and capacitors are 6, and 
5. , respectively; «, and e, are the tracking ratios between the three 
resistors and the three capacitors, respectively; AT’ is the temperature 
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range under consideration; and x, and x, are the percentage resistor and 
capacitor aging, respectively. Substituting equations (51) and (52) 
as well as the expressions in Table I into equation (48), we obtain for 
the real part of the finite zero displacement Az = Ac + 7A wy 


Ac _ a 
‘Oy sa 2(1 + a’) [e.(1 


Notice that the zero displacement parallel to the real axis depends 
only on the amount of mistracking between components and not on the 
absolute drift of the individual components. In other words, if all 
components of a kind drift by the same percentage, the null depth of a 
tuned twin-T will not change. This is to be expected since equal com- 
ponent drift corresponds to a frequency scaling process. 

The imaginary part of the finite zero displacement Az is obtained in 
the same way as the real displacement above. Thus 


— v) — «(1 — d)] AT. (53) 


Awyn 


See S(O, es Os) AT each ik 


Wn 
e, AT 14 e@7% cA [a +227], 
mers paar c sae eal ear ero (54) 


Thus, the zero displacement along the jw-axis depends on the actual 
drift of the individual components. Clearly, if the drift coefficients of 
the resistors can be made equal but opposite to those of the capacitors, 
drift along the jw-axis can be practically eliminated. 

In various active filter schemes the network poles are tied closely to 
the transmission zeros generated by a twin-T. Thus, in high Q networks, 
uncontrolled drift of the twin-T zero into the right-half s-plane could 
pull the poles over with it, causing oscillation. Similarly, drift of the 
twin-T zero along the jw-axis would cause frequency drift in the active 
filter. 

To prevent oscillation due to drift into the right half plane, the 
transmission zero of the twin-T' must be located left of the jw-axis by 
some distance oi, Such that, under worst case component drift, it will 
not travel across the jw-axis. Referring to Fig. 12, this implies that 


Omin = Re (AZ ane) = BO vine . (55) 


This condition, in turn, implies that the twin-T null depth may not 
exceed a certain maximum attenuation Ty max Which can now be ecal- 
culated directly. 

It follows from Appendix B that the transfer function of a twin-T 
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with a nonperfect null can be approximated as follows 
s° + Qos + wy ; 


T x(s) ~ i? 
sts +07 
Qn 


(56) 


With equation (55), the maximum null attenuation for left half plane 
transmission zeros is then 


Ty max er = 2 dw Omin = 2 qn AG sas . (57) 
On WN 


With equations (22) and (53) this becomes 





oe eo ee - 
Ty max = ita (7 nat eae ee ) [e(1 — A) + «(1 — v)] AT. 


(58) 


In active filter applications where the twin-T transmission zero 2 
represents the open loop zero of the root locus of a pole p with respect 
to gain (see Fig. 12), the highest attainable Q of the network is all the 
more limited the larger o,;, has to be chosen for stability. In the limit, 
as the loop gain approaches infinity, the closed loop pole p coincides 
with z. The upper limit on Q is therefore given by 


Qmax <5 (59) 


20min 





or, with equation (53) 


(ise) 1 
Qmax < Tap (= yeh 5): (60) 


Thus, with the type of active network design represented by the root 
locus in Fig. 12, both network stability and maximum Q ultimately de- 
pend on the stability of the twin-T network. 

As an example of the above, we shall consider the stability of a sym- 
metrical twin-T network fabricated with tantalum thin film resistors 
and capacitors. The required ambient temperature range is assumed to 
be from 0°C to 60°C. From equation (58) we obtain 


DL Neshas A=v=0.5 — rele, a €.] AT (61) 
a=1 
Typically, for tantalum thin film resistors and capacitors «, = +5 


ppm/°C and e, = +15 ppm/°C. Therefore Ty mx = 7.510°° = —83 
dB. 
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The frequency drift for a symmetrical twin-T results from equation 
(54) as 


A ($8) + 8 POM tte. 6 


@ 

Typically, for tantalum thin film components x, = x, = 0.1% and for 
tantalum thin film capacitors 6, = 200 ppm/°C. The TC of tantalum 
thin film resistors can be controlled by oxygen doping during the sput- 
tering process.’ It may therefore be of interest to solve equation (62) 
for the required TCR, that is, 6,, when a maximum acceptable fre- 
quency drift is specified. Assuming that (Awy/wy)mx < 0.5% we 
obtain 6, = (—215 + 50) ppm/°C. 


VI. CONCLUSIONS 


Design equations have been presented that provide twin-T con- 
figurations with null characteristics that depend on individual com- 
ponent variations in a predictable and controlled manner. This is 
achieved by deriving the zero sensitivity functions with respect to 
each component of the general nulled twin-T. The network parameters 
required to obtain a twin-T that can be tuned by a desired procedure 
and the extent to which such a procedure can at all be realized results 
directly from inspection of the general sensitivity functions. 

Special null tuning procedures are considered that are useful in 
linear active networks incorporating a twin-T in the feedback path. 
One twin-T configuration permits the null frequency to be shifted over 
a limited range by the variation of one component only while the null 
depth remains constant. Another enables the null depth to be varied by 
one component with negligible variation in the null frequency. The pos- 
sibility of independent null frequency and null depth tuning by two 
individual components is also investigated. Orthogonal sensitivity func- 
tions that are parallel to the real and imaginary axis are required to 
do this. It is shown that, apart from the orthogonality that is obtained 
as a by-product in the first two cases, any other or more general kind 
of orthogonality in the sensitivity functions cannot be realized. Design 
examples for the first two cases are given. Measurements conducted 
on the resulting twin-T configurations are presented and compared 
with similar measurements made on a conventional, that is, symmetrical, 
twin-T. This comparison demonstrates the effectiveness of the given 
design equations. 

The stability of the null characteristics of a twin-T with given 
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component characteristics results directly from the sensitivity functions. 
Limits on the permissible null depth of a twin-T are derived for the 
case that left-half plane zeros must be guaranteed under worst case 
component drift conditions. Similarly, expressions are derived that, 
permit the limits on resistor and capacitor temperature coefficients and 
aging characteristics to be established in order not to exceed a given 
maximum frequency drift of the transmission null. A numerical example 
is given using the characteristics of tantalum thin film resistors and 
capacitors. 
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APPENDIX A 


Twin-T Impedance Matrix 


In order to calculate the open circuit impedance matrix of the twin-T, 
it is useful to first obtain its general equivalent z-network. This can 
be simply obtained by converting each of the two T-networks of the 
twin-T into its equivalent z-network. This is shown in Figs. 13a and 13b, 
assuming sinusoidal input signals. The two resulting z-networks can 
then be connected in parallel (as shown in Fig. 18c), and the resulting 
impedance directly calculated. With the two conditions for a perfect 
null given by equations (5) and (6), we get a simple z-network as 
shown in Fig. 14. The corresponding impedances are given by 


zo (042), « 
Pon 

ania) eS) 
i+2 


and 


, (65) 
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1 
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~G Race 
2 





(c) 


Fig. 13—(a) and (b), Conversion of the two T-networks of the twin-T into equiva- 
lent z-networks; (c), Equivalent 7-network of the twin-T. 


where 
n=RC,, 
Tm = RC? , 
7r,=EC,, 


T= R,C3 = i, . 


In terms of these impedances, the open-circuit impedance matrix for 
the twin-T simply follows as 


[z] oo 1 ZAZ» + Z.) ZL i 
Zatds+4.| 77, 24%,+%,) 


So far we have considered a general twin-T that has an infinite null 


(66) 
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Ze 


Fig. 14—General z-network. 


at a specified frequency wy . Because the corresponding general transfer 
function [see equation (10)] has the form of a quadratic fraction, it 
has geometric symmetry around wy . 

The most frequently used twin-T is structurally (and electrically) 
symmetrical. For this case. 


m=? =7T,=7T = RC, (67) 
and 
C= 2. (69) 
The impedances of Fig. 14 then become 
R 1 
a ae 2 (3 a4 4) (70) 
and 
_ op A +87), 
Z, == 2k 1 + 8°7” (71) 


The open-circuit impedance matrix consequently becomes 


1+ 4s7 + $’7 I14+97 

R s7(1 + s7) sr(1 + s7) 
ss 1+.8°7° 1+4s7 +37 

s7(1 + sz) s7(1 + s7) 


@). = (72) 
The voltage transfer function for the symmetrical twin-T then follows as 


2.2 
Ty,(s) = 2 = ee 


Ze ore t+ 4rs t+ 1t (78) 


Comparing with the transfer function given by equation (10), we have 


1144 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1970 


1 
On, = RC ’ (74) 
2ow, = oa (75) 
and the inverse damping factor 
qu, = 0.25. (76) 


It can be shown that the selectivity, that is, the inverse damping 
factor, can be increased by modifying the symmetrical twin-T into a 
potentially symmetrical network. This is possible with any structurally 
symmetrical network for which Bartlett’s bisection theorem holds. 
A symmetrical network can be converted into a potentially symmetrical 
network by impedance scaling one half of the network by some factor p. 
This is shown for the twin-T in Fig. 15. The corresponding z-matrix 
then becomes: 


f 1 
s’r* + oft So Len +1 








MiAIES Rin | cose Asa Te 
p sr(1 + sr) st(1 + sr) 
(2)os = {x -) 
r 14+ s°7? s’r” + 201 + p)sr +1 
sr(1 + s7) sr(1 + s7) 
(77) 
and the voltage transfer function results as 
2.2 
Pie izes = a. (78) 
ne PP + 2(1 + Ler +1 
In terms of the transfer function (10) we find: 
1 
Onvps = RC ’ (79) 
2 (p+1 
20 = SH (e+) 80 
On R p ( ) 
and 
1 
QNos a 9 1 zs o (81) 


p gives a measure of the twin-T symmetry. For the extreme asymmetrical 
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Fig. 15—Potentially symmetrical twin-T resulting from symmetrical twin-T after 
one-half of the twin-T has been impedance-scaled by a factor p. 


case for which p > 1, qy,, takes on its maximum value, namely 


Qn oe ae =e oD (82) 


APPENDIX B 


Twin-T with a Finite Null 


Inspection of equations (8) and (9) shows that the transfer function 
of a general twin-T is simplified by one degree due to the pole-zero 
cancellation on the negative real axis at w, = 1/R3;C, when the conditions 
for a perfect null given by equations (5) and (6) are satisfied. We 
investigate here the conditions necessary to ensure this pole-zero 
cancellation when the null-conditions are only approximately satisfied, 
that is, when the twin-T has a finite null. To do so we derive the sen- 
sitivity of the pole and zero at w, with respect to the six parameters of 
the twin-T and investigate under which conditions the respective pole 
and zero sensitivities are the same. 

Writing the twin-T transfer function in the bilinear form with respect 
to a parameter x we obtain: 


N(s) _ Az(s) + 2B.(8) 








Iw8) = Dey = Ue) + 2V.0) (88) 
The sensitivity of a zero z with respect to x is then given by: 
: _ & —2A,(s) _ __ (s — 2B.) 
aC ae mea (Oa oF 64) 
that of a pole p with respect to x by 
> _ (s — p)U.(8) _ _,, & — p)V2(s) 
BED lag DGS ec sa 
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From equations (84) and (85) we obtain 








2 gah eA 
8; Z2—-Wy 1 + a A.( 1) (86) 
and 
8; are “1 i U,(—a) 
Ita -— 
N 
+ = U(—w) (87) 
G-;h)+e0-74) 
1—)7 —yp 
where 
Wy 
a= wy ’ (88) 
R, 
dr R+h.’ (89) 
C; 
v C0, (90) 
and 
Ov _ a(l — v)(1 — d) (91) 


= Dey FUN +0 


Calculating the respective A,(—w,) and U,(—w,) functions we obtain 
the zero and pole sensitivities listed in Table IV. Comparing the func- 
tions listed in the two columns of the table it is clear from inspection 
that they will be equal when 


AL =») _ 

v1 — 2) : (92) 
With equations (88) through (90), this condition becomes 

RC, = RC. . (93) 


Thus for all twin-T configurations in which the time constants of the 
series elements are the same, pole-zero cancellation on the negative 
real axis is maintained for differentially small perturbations of any 
element of the twin-T. For positive element changes (that is, increasing 
values) the dipole frequency will decrease, that is, move in the direction 
of the origin of the s-plane. Twin-T networks that satisfy equation (93) 
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TaBLE [V—SeEnsrtiviry FuNcrions FoR THE NEGATIVE REAL 
PoLe AND ZERO OF THE Twin-T LocatTED AT—«a, 











Zero Sensitivity Pole Sensitivity 
a a 
Sr; =appgd Sr, ea 
v(1 — A) 
a a 
aa a a a CT a 
v(1 — d) - 
gre = 1 gre 1 
Rs Sa ye Rs sive »v(1 — X) 
“dd — 2) 
en 1—»p ae (1 — »r) 
Se, aye Se, oa dea leEN) 
7 A(1 — v) 
hea? fox v ~o1 v 
So, = 1 1 + ine Sc, @) f 4 ; v(1 = d) 
“x — ») 
a a 
x1 — a») * ® 





include all symmetrical configurations in which the series elements are 
identical as well as potentially symmetrical configurations in which 
the series elements are characterized by relations of the type 


Rk, = ak,, 
GC, = C,/a. 


(94) 


APPENDIX C 


Twin-T Zero Sensitivity 
Expressing the numerator NV(s) of the twin-T transfer function 
N(s) = A.(s) + 2B.(s) (95) 


the null return difference F3(s) with respect to x is given by 


1148 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1970 


Ns) B.S). 
A,(s) A,(s)_ 
With equations (1) and (2), the null return difference of the twin-T 
with respect to its six components can be calculated directly. 

To obtain the null return difference of the nulled twin-T, equation 
(8) can be substituted into equation (96), namely 


Fis) = =1l+2 (96) 


F%s) = (s cpale fe a (97) 


where, w, = 1/R,C, = 1/R,C;. The corresponding zero sensitivity 
then results as 


fon __ s$- Jon rae (s = Jwy)o,0yA,(s) 
oe, AE ho eae aay Weise. 
which simplifies to 
rec SI) Aces (99) 


2(1 + a’) 


where a = w;/wy. The individual A,(jwy) functions follow directly 
from equations (1) and (2). Substituting these into equation (99), 
the zero sensitivity of the nulled twin-T with respect to its six com- 
ponents is obtained. These are listed in Table I. 
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Thermal and Electrical Properties of Coated 
Conductive Substrates for Integrated 
Circuit Chip Mounting 


By H. G. MATTES 
(Manuscript received February 10, 1970) 


Presently used substrates for integrated circuit chip mounting and 
interconnection provide limited heat sink capability and preclude the 
effective use of matched wmpedance transmission line interconnection. 
These characteristics can be improved by the use of very narrow microstrip 
lines on thin dielectric layers backed by thermally and electrically con- 
ductive material. We propose a model for calculating the effective thermal 
resistance of layered substrates and make calculations for a variety of 
materials. The thermal problem ts considered for heat extraction either 
through the edge or through the face of the substrate. Improvements in sub- 
strate thermal conductance of an order of magnitude appear realizable 
compared to presently used substrates (for example, 0.625 mm alumina). 
We analyze the electrical parameters of narrow (0.02-0.1 mm) microstrip 
lines on coated substrates in the frequency range 10 MHz to 10 GHz. The 
characteristic ampedance, propagation delay time, and attenuation are 
found to be frequency dependent, and efforts to minimize this frequency 
dependence by magnetic loading result in greater, though more constant, 
delay times. Losses may be significant (~2-5 dB/cm in many cases), 
but short line lengths due to dense circuit packing allowed by improved 
heat disstpating capability will minimize this disadvantage. 


I. INTRODUCTION 


A necessary component in realizing the potential of integrated circuit 
technology is the substrate upon which the integrated circuit chip is 
mounted. The substrate must meet several requirements to avoid 
deterioration of the system’s performance: 


(z) The substrate must act as a rigid, reliable mounting platform for 
the chip. 
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(iz) The substrate must allow the dissipation of heat produced on 
the chip without causing an increase in temperature which would detri- 
mentally effect circuit operation. 

(i277) The substrate must permit electrical connections to and inter- 
connections between chips in a manner compatible with desired circuit 
performance. 

(wv) The substrate fabrication and chip mounting must be realizable 
at a cost consistent with the cost requirements of the system. 


Presently used substrates (for example, 0.625 mm thick Al,O; ceramic 
wafers) are limited in their attainment of objectives 7 and iz 
outlined above. Heat sink capability is limited by the low thermal con- 
ductivity and relatively large thickness of material necessary for mechan- 
ical rigidity and by the difficulty in extracting heat from the substrate 
itself. High frequency (>500 MHz) performance of beam-leaded in- 
tegrated circuits suffers from the transmission line mismatch necessary 
with conventional substrates as is discussed below. 

A new materials configuration, as shown in Fig. 1, is envisioned which 
would simultaneously satisfy the four basic substrate requirements 
listed above. A thin dielectric layer (0.001 mm to 0.05 mm thick) 
deposited or grown on a massive electrically and thermally conductive 
ground plane could allow efficient heat dissipation, effective electrical 
interconnection and provide a rigid, inexpensive bonding plane. It is 
the purpose of this paper to calculate the thermal and electrical prop- 
erties of the coated substrate configuration suggested here. 
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Fig. 1—Schematic drawing of layered substrate, 
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II. THERMAL PROPERTIES 


The model used to represent the proposed substrate for calculation of 
thermal properties is shown in Fig. 2. All constants may be arbitrarily 
set, and the heat flow may be subjected to either of two distinct bound- 
ary conditions: the sides of the disk may be held fixed at a constant 
temperature and the top and bottom faces insulated, or the bottom face 
may be held at a constant temperature and the sides and top insulated. 
These cases closely approximate two heat sinking configurations fre- 
quently found in practice. The cylindrical geometry was chosen for 
ease of calculation (the square or rectangular problem, although alge- 
braically simpler, results in a double series which presents formidable 
convergence problems when one is interested in a numerical result). The 
specific geometry should have little effect on the calculated values, and 
any reasonable geometry (held constant throughout the calculations) 
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Fig. 2—Model used in calculations of thermal properties. 
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should provide a reliable comparison between alternate substrates. 
The heat input was assumed uniform over a circle of radius A, although 
other heat input distributions might have been chosen. As in the choice 
of geometry, this should have a minimal effect on the results. (Currently 
used beam leaded chips distribute heat over an area larger than the 
silicon dimensions. It can be assumed that the entire chip-beam lead 
structure thus forms a relatively uniform heat source on a substrate.’) 


2.1 Algebraic Solution 


The thermal problem has already been solved for a homogeneous 
cylinder.’ In the case of a layered cylinder, it is necessary to solve La- 
place’s equation simultaneously in both the upper and lower regions 
and match boundary conditions at the interface. In the calculations 
described in this paper, no discrete thermal resistance was inserted 
at the interface so the matching of boundary conditions reduced to 


Oy 2 ge OU: 


K 02 02 


= Kk. 





where K, , T;, , K., J’. are the thermal conductivities and temperatures 
in regions 1 (upper) and 2 (lower). The insertion of a thermal resistance 
at the interface is trivial but was approximated as zero due to the inti- 
mate interface contact obtained by the method of materials preparation 
envisioned for these substrates. 

Laplace’s equation was solved for the two sets of boundary conditions 
(sides grounded and face grounded) by the method of separation of 
variables. In cylindrical coordinates, V°7' = 0 reduces to 


T = Wo + Wz 
+S Ga nlA nd m(7) + BAVA] moh 


[exp (+£6,2)] (2) 


where boundary conditions exist to impose eigenfunction solutions in the 
r direction, and J,, and N,, are Bessel functions of the first and second 
kind respectively. The boundary condition on the grounded surfaces was 
given as T’ = 0 while 07'/d (normal) was set equal to zero on the insulated 
surfaces. On the top surface (z = 0) the boundary condition was written: 


oT jeu 0<r<A, 2z=0; 


i aa Ki 


0 A<r<B, z2=0. 


(3) 
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f is the value of heat input in (watts/unit area). This boundary con- 
dition was met by using the orthogonality properties of the Bessel 
functions in a Fourier-Bessel series as outlined in Morse and Feshbach 
and other texts.° 

For the case in which the sides are grounded at T = 0, the tempera- 
ture distribution is found to be 


apa (0 '5)4-(0-5) 


T= ds BK, ——Pe) E cosh (82 ) + c, sinh (2 )| (4) 


where 8, is the nth zero of Jo(r) and 
a tanh? (etfs) _ (A a 1) tanh (Bes ye) tanh (? ATS eH) a a) —1 


AW, Ky 1 2. : 
eon (e0 )f — Be — tanh (2 av.) ce (2h AW, + W. \\ 4% Ae tanh (2 E: Ww) 














for O<z2<W,; 


(5) 


tanh? (#12) —1 
lee a TAF 2 Ks seen (2:1 W, y age (6: Ws + t¥a)| 4% Ke tanh (2 BAW + W.)) , 


for Wi<z2<WitW,; 





1, for 0<z2<W,; 


fie ue + 2) [1 er 4) 


fee ee | 
WV K : : 
tanh (2:0. y E ae (2x7 ) eae (200 + Wa)) + toon (5x10. + W2)) 


, (6) 





fo Wi<z2<W, +. 


For the case in which the bottom face is grounded at T = 0, the 
temperature distribution is found to be 


pat 
T — B? f(d, + d.Z) 


wap ML ftp suaalle 





n on K J o(Gn) 
where a, 1s the nth zero of J,(r) and 
d, = i : ; (8) 
Wit WwW, 


K, Wi<z2<W,+W, 
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I 
i 0<2<W, 
d, = : ? (9) 
] 
K, W, < Zz < W, + W, 
tanh (atts + )) — tanh ats [ts nh cua) tanh (=o(tl + Wa)) _ 1| + i} 
» (al; K» an an(W, + W2)) _ Ky, : 
tanh ( B ) + (% - 1) tanh ( B ) tanh ( B K, 
ake for 0<z2<W;; (10) 
nV + Wa) 2 fa,W, 
tanh (c(t = Wa?) [1 — tanh ( )] 
B B 
tanh’ (ass) + (&: _ 1) tanh (<stfs) tanh (2 R W2)) — 5 
for Wi<z< Wit W; 
1 fo 0<z2<W, 
tanh? (ast?) —1 
(11) 





dy = ? 
2 a) (@ - 1) (=f) (22 + ¥a)) _ Kp 
tanh’ (sth B + K tanh B tanh = a K, 

for Wi<2<W,+W,.. 


A quantitative measure of the power dissipating capabilities of a 
substrate is given by the thermal resistance 


R (degrees c) = AT(°C) _ Prax (12) 
therm \ watt q (watts input) ~—- + A”f 
Tmax Clearly occurs at z = 0, r = 0 s0 
A 
2 1 I:(6, 5) 


Rinerm (side grounded) = TKA > a TG) 
Ko anh? (ee). (Ke > 1) ta ae (6th : Ps) baat (6h = ws) id 








: Ky B Ky 
aal)| ey (22%) (2 nm Wa) | K, (uz + 2) (13) 

tanh (20s B 1 K, tanh B tanh 2 + K, tanh R 

and 
In(e.5) 
_ WiK2 + WK, 2 tee 
Reverm (face grounded) — 7B’K,K, TK,A Sa J7(a,) 
tanh (a Ed) _ tanh eae [ts nh (2-H) tanh (s(n + Wad) ~ 1| + iS ( 
F 1 : 14 








2 (adVy Kz _ any a(t ra) _ kK, 
tanh ( B ) + (& 1) ta nh (a ‘) tanh ( B K, 
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2.2 Computer Program 


A ForTRAN IV computer program was written for an IBM 360-50 to 
evaluate Renerm-sidae ANd Rinerm-tace. Lhe first forty values of alpha 
and beta (the zeros of J; and Jj) are inserted in the program from the 
tabulated values of Watson.* The values of the higher zeros are obtained 
in the program by analysis of the asymptotic approximations for J, 
and Jy. 

Since the expressions for Rin.r:m take the form of infinite series, no 
completely precise evaluation can be made. Figure 3 shows the ap- 
proximation to Rtnerm-face given by truncating the summation after 
varying numbers of terms. Plotting data similar to Figure 3 for a range 
of all input parameters (A, B, K, , K. , W, , W.) disclosed several trends 
in the convergence of the series. The “‘period” of the oscillation was 
inversely correlated to the ratio A/B. This is mathematically consistent 
with the use of a Fourier-Bessel series to describe the top face of the 
disk. For ratios of A/B less than 0.02, the first maxima in the expression 
for Rinerm-side 1S Not reached until partial sums with greater than 300 
terms are evaluated. The amplitude of the oscillations in the function 
Rinerm (nUMber of terms) was positively correlated to K./K, and W,/W.. 
The damping of the oscillation (per cycle) appeared to be relatively 
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Fig. 3—Partial sums approximating pice for A = 0.0625, B = 3.75, K, = 0.20, 
Ke = 2.05, Wi = 0.0005, W2 = 0.0625 
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independent of input data. These variabilities in the behavior of Rinerm 
made determination of a precise value difficult. Additionally, only the 
first 314 terms of the series could be evaluated. The operation of the 
program was terminated at this point by a floating point overflow which 
occurred in the IBM-SSP subroutine used to generate the Bessel func- 
tions. Since the largest arguments of the Bessel functions are intrinsic 
to the problem, and are not functions of the input parameters, nothing 
could be done to circumvent this difficulty. It thus became necessary 
to evaluate Rinerm on the basis of a truncated series of 314 terms of less. 

A graphical outline of the method used to extract an approximation to 
the asymptotic value of Rinerm from a finite number of terms in shown 
in Fig. 4. The first two local maxima, M, and M;, and the first local 
minimum, /,, were recorded; and the averages, A; and A, , were cal- 
culated where 


ies (15) 
i (16) 








Ny Ne N3 


Fig. 4Pictorial of extremum averaging technique used for improving series 
convergence. 


COATED CONDUCTIVE SUBSTRATES 1159 


In Appendix A, a theorem is stated and proved to show that the asymp- 
totic value of Rinerm 1s less than A, and greater than A,. An average of 
the two, (A, + A,)/2, was used as a most probable value of Rinerm . 

As a test of the accuracy of the derivation of Rinerms-side 200 Riherm-tace 
and the computer programs for generating these values, K. was set 
equal to K, (equivalent to a uniform, single layer cylinder) and values 
of Rinerm Computed for W, = 0.625 mm, W. = 0; W, = 0.875 mm, 
W. = 0.25 mm; and W, = 0, W. = 0.625 mm. The three cases were 
in complete mutual agreement, and agreed to the accuracy given with 
the published results of D. P. Kennedy for heat conduction in a homo- 
geneous, isotropic cylinder.” 


2.3 Numerical Evaluation 


Several materials combinations were chosen for evaluation as possible 
substrates. Some of the considerations for inclusion were compatibility 
with integrated circuit techniques (for example, silicon), smooth surfaces 
for microdeposition of conductor paths (for example, polished silicon, 
cold-rolled aluminum), low dielectric constant (for example, SiON), 
and thermal expansion coefficient. The materials considered and the 
associated values of K used with them are shown in Table I. A = 0.625 
mm and B = 3.75 cm were chosen as representative of typical inte- 
grated circuit chips and substrate sizes and were used in most of the 
calculations. 

The results of these calculations for a variety of layered substrates 
are shown in Figs. 5 through 10. With high conductivity dielectrics 
(for example, Al,O;) the thermal resistivity of the ground plane con- 
tributes significantly to the total thermal resistance, but resistances as 
low as that of beryllia are attainable for a considerable range of dielectric 
thicknesses. Low conductivity, glassy (for example, SiON, SiO.) dielec- 
trics, can result in thermal resistances equal to or greater than that of 
alumina if glassy layers of greater than 0.03 mm thick are used. 


TABLE I—THERMAL CONDUCTIVITY OF SUBSTRATE MATERIALS 











Material K (watts/em °C) 
.05 


Al,O; (deposited) 
Al,O3 (sintered) 
BeO 


Si 
SisN, 
SiOz 
SiON 


coooorconw 
19 6) 
oo 
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Fig. 5—Aluminum oxide on 0.25 mm aluminum. 


For comparison, thermal resistances of presently available homogene- 
ous substrates were calculated and are given in Table II. 

Figure 11 shows the effect of substrate radius on Rinerm-siae (the 
effect on Rinerm-tace WAS <2% for B = 0.5 cm). In the range shown, 
heat is presumably flowing radially at r = B so increases in B add to 
Rinerm at a rate proportional to 1/B. Consequently, a “critical radius” 
exists above which Pinerm-side 1S relatively constant. 

Radiative and convective heat losses will be negligible for the high 
conductivity layered substrates discussed in this paper. A lumped value 
of convective and radiation coefficient of 0.003 watts/em.” — °C 
results in an equivalent parallel thermal resistance of greater than 50°C/ 
watt for all cases considered here. 


2.4 Thermal Results 


Layered substrates consisting of dielectric layers backed by thermally 
conducting ground planes can provide substantial improvements in heat 
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sink capability over alumina ceramic substrates. Ceramic substrates 
are commonly used in 0.625 mm thickness to provide needed strengths. 
Metal ground planes could be reduced to 0.25 mm thickness while 
maintaining physical strength greater than 0.625 mm ceramic. For 
situations where heat sinking the back of the substrate is possible, 
Al,O3; on 0.25 mm aluminum can provide up to twice the heat sink 
capability of 0.625 mm beryllia. Metal substrates have the additional 
advantage of convenient, low thermal barrier heat sink mounting or 
even direct incorporation into the heat sink structure (for example, 
heat pipes, finned substrates). 

Although the above values of Rinerm are significant, as given, for 
relative comparison of substrate materials and geometries, their ab- 
solute significance can only be assessed in relation to other parameters 
intrinsic to an integrated circuit heat dissipating system. V. E. Holt 
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@ SIDE GROUNDED 
A FACE GROUNDED 


O,4 0,625MM BERYLLIA 
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Fig. 6—Aluminum oxide on 0,625 mm aluminum, 
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Fig. 7—Silicon oxynitride on 0.25 mm aluminum, 


has found thermal barriers of 32°C/watt for 1.25 mm chips conducting 
heat to a substrate by means of beam leads and a resin bonding layer.’ 
This value is reduced to 3 to 5°C/watt by AuSi eutectic bonding. 


Imperfections in the heat sink will have to be assessed in each situation 
encountered. 


II. ELECTRICAL PROPERTIES 


3.1 Microstrip Transmission Lines 


Electrical interconnections on the proposed substrates would best be 
made in the form of standard microstrip transmission lines.” They can 
provide transmission line interconnects necessary for high speed pulse 
and microwave electronics while maintaining low values of crosstalk 
between adjacent lines and still provide de and low frequency paths 
for other circuit requirements. The primary incompatibility of presently 
used substrates and beam-leaded integrated circuit chips is due to the 
impedance mismatch between the narrow (0.02 to 0.05 mm) beam leads 
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and the physically large transmission line (typically 502 implies 0.625 
mm line width on 0.625 mm alumina).’ The removal of this discontinuity 
would require narrow conductors deposited on a dielectric layer of 
appropriate thickness to maintain the desired transmission line im- 
pedance. The physical configuration contemplated for small microstrip 
lines is outlined in Fig. 12. To minimize the impedance discontinuity 
at the interconnection with beam leaded chips, W is restricted to 0.1 
mm maximum. To minimize crosstalk, this would suggest t; S 0.05 mm’. 
These figures, coupled with the electrical parameters of dielectric materi- 
als sufficiently smooth to allow the reliable deposition of 0.02 mm 
conducting paths, are consistent with the desire to develop lines with a 
characteristic impedance of 502, the most commonly used interface 
impedance for circuitry operating above 500 MHz. Under certain con- 
ditions, as seen below, it is desirable to raise the series inductance of the 
transmission line. For this reason, provision has been made for placing a 
thin layer of high permeability material between the conducting strip 
and the dielectric. In practice the adhesion metal of a sandwiched inter- 
connect (dielectric—adhesion metal—gold) might serve this purpose. 
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Fig. 8—Silicon oxynitride on 0.625 mm aluminum. 
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Fig. 9—Silicon dioxide on 0.25 mm silicon. 


3.2 Transmission Line Parameters 


Although many transmission modes are necessarily present in an 
unsymmetrical transmission line such as an unshielded microstrip, the 
TEM mode will be considered of dominant interest in the frequency 
range considered here and the line parameters will be calculated for 
this mode. Several authors (for example, Assadourian and Rimai,’ and 
H. A. Wheeler®) have derived expressions for the characteristic imped- 
ance and other pertinent parameters for microstrip transmission lines. 
Although these formulae have been generally verified in their range of 
approximation’’’’’° the physical scale of lines contemplated in the last 
section requires the inclusion of several terms which had been ignored 
in the previous derivations. 

The characteristic impedance, Z,, and propagation constant, v, of 
any power transmission system are given by” 


— (R + jol 3 
figs (f + ish) : aa 
y = [RB + joL)\(G + joC)}. (18) 
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Ignoring losses (R, G — 0), the propagation delay time can then be 
written 


r= Ht = (Loy. (19) 


The inductance of the line is given by two components: the inductance 
due to magnetic field energy storage in the strip conductor and the 
ground plane (the internal inductance); and the inductance due to mag- 
netic field energy storage between the strip conductor and the ground 
plane (the external inductance). The internal inductance of the strip 
conductor can be derived from Ramo and Whinnery” and is given by 


sia OES 2h) 
a. Pun (24) sin (2. 


Wow) 6; (24) (24) 
2 cosh — cos ra 


6, 
where p, is the resistivity of the strip conductor material and 6, is the 








(20) 
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Fig. 10—Silicon dioxide on 0.625 mm silicon. 
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TABLE JI—THERMAL RESISTANCE OF CONVENTIONAL SUBSTRATES 


Rtherm-side Riherm-face 
(degrees C/watt) 

Alumina 

0.25 mm 103 6.8 

0.625 mm 46 12 
Beryllia 

0.25 mm 15.3 1.02 

0.625 mm 6.8 1.8 


skin depth of the strip conductor at the frequency, w, of interest. [6 = 
(2p/wu)*]. For the frequency range considered in this paper (f > 10 
MHz) current spreading in the ground plane can be ignored for lines as 
narrow as 0.02 mm if the ground material has a resistivity <107° 
ohm-cm. For higher resistivity materials this approximation is less 
valid, especially at the lower frequencies, but the approximation will 
be made for convenience. A more rigorous analysis in the higher resis- 
tivity case would give increased internal inductance and increased 
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Fig. 11—Thermal resistance with side grounded A = 0.625 mm W, = 0.0125 
mm, W, =.0.625 mm. 
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Fig. 12—Schematic drawing of microstrip transmission lines on layered substrate. 


crosstalk through magnetic fields generated by spreading currents in 
the ground plane. The internal inductance per unit length of line due 
to fields in the ground plane can then be given by a formula similar to 


(20) 
: 2t, 2. (2E 
oer (2) — sin (2) 


1 
Wa | 6, (24) (24) 
cosh oe ae cos 3 


L, = (21) 


where p, and 6, are the resistivity and skin depth of the ground plane. 
For the dimensional range of interest (W > 2t,) the external inductance 
per unit length is approximately given by English and McNichol” as 


Lest = Fy (ta + Hate) (22) 
where pe is the relative permeability of the magnetic material. Total 
Lis given by the sum 

L=L, +L, + Lest. (23) 


The total capacitance per unit length between the strip conductor and 
the ground plane is the sum of four components 


C= Cr C, + Cr + C,. (24) 
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C’, is the capacitance per unit length between the top of the strip and 
the ground at infinity™*. 


Gist + In 2. (25) 


C, is the capacitance per unit length, neglecting fringing, between the 
face of the strip and the ground plane. 


C; = €0€R La (26) 
tg 
where eg is the relative dielectric constant of the insulating layer. The 


additional capacitance per unit length due to the fringing fields at the 
edges of the strip is given by Joines’® 
7 
me 


Tv 
2ta 





¢, = M88 [1+ (1 - exp 


For the case where the strip conductor is not of negligible thickness, an 
additional contribution to the capacitance is due the capacitance be- 
tween the edges of the strip and the ground plane. The expression for 
this term, derived in Appendix B, is 


2 
¢, = 2ate ay {Best Bh Ble ble BAT (28) 
d 


Losses on these microstrip lines will be considered as a perturbation 
in the calculation of the other parameters although this approximation is 
quite poor in several of the cases considered below. Losses in the dielectric 
will be ignored, being much less than the conductor losses for the geo- 
metries discussed in this paper. Under considerations similar to those 
for deriving the internal inductance, the equivalent series resistance is 
given by™?'” 


sinh (24s) + sin (24) sinh (24) + gin (24) 
Ps 8 ea Do g g : (29) 


a cosh (24) — cos (24) ms cosh (24) — cos (4) 
5/ ~ > iM 


The attenuation per unit length of line is given by”* 








R= 


a 





R 
a= 97 (nepers). (30) 


Although this expression is rigorously valid only for | R/w | « | L|, it 
will be used in all computations made in this paper. In the case where 
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| R/w | = | L|, the line will have a significant reactive component in its 
characteristic impedance. Equation (80) for a will then contain a con- 
tribution for phase shift as well as attenuation. In this paper | Zp |, 
Re (Z,), and Re (a) will be calculated. 

The equations for crosstalk between adjacent, parallel strip lines 
derived by Kordos® apply directly to microstrip lines of the dimensions 
considered here if the characteristic impedance is calculated by the 
means outlined above. In these cases, Kordos derives an expression for 
the near end crosstalk of properly terminated parallel lines of length, 1. 


Vols) 30 In [1 + (44/WYE 
Vi): Re Zod(en)" 


V,.(s) and V,(s) are the Laplace transforms of the near-end crosstalk 
voltage and the input signal respectively. The far-end crosstalk is 
approximately zero. This formula is only valid for ¢,, = 0. For t,, > 0, 
Ve would be further reduced by the “keeper” effect of the magnetic 
material on the magnetic field of the driven line. 


[1 — exp (—2srl)]. (31) 


3.3 Numerical Evaluation 


The formulas given above have been applied to layered substrates of 
the type discussed earlier. The calculations assumed that the strip con- 
ductors (Au — 2.5 X 10°°Q—cm.) should exhibit a resistance of 
<0.0040/sq. to give a de resistance <1.50/em for the narrowest line 
considered, 0.02 mm. This results in t, = 0.00625 mm. Silicon and 
aluminum were considered as possible ground plane materials. The 
maximum doping level in silicon which allows the preservation of a 
smooth surface after processing is approximately 2 X 10° atoms/em® 
which results in a bulk resistivity of approximately 2.5 X 10°Q—em. 
The constant parameters used in the calculations are shown in Table ITI. 

The expressions given in Section 3.2 were evaluated for a spectrum 
of frequencies between 10 MHz and 10 GHz for several physically 
realizable dielectric-ground plane materials combinations with varying 


TasBLeE ITJ—E.uecrricaL PARAMETERS FOR STRIPLINE 
CALCULATIONS 








(All dimensions in cm) 


ps (Au) = 2.5 X 10-6 t, = 6.25 X 107 

pg (Al) = 2.5 X 1075 eg (sintered Al,O3) = 9.6 
pg (Si) = 2.5 X 1073 €z (deposited Al,O;) = 9.0 
LR os nas é€z (SION) = 3.8 

ts = 6. 





5 x 10-4 ep (SiOz) = 3.6 


1170 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1970 


thicknesses of magnetic material. A representative sampling of the 
results is shown in Figs. 13 through 20. These data are not meant to be 
comprehensive or complete but merely to represent the effects and 
trends observed in the calculations. 

Figures 13 through 16 give Re (Z,) and 7 as a function of dielectric 
thickness and/or line width. The effect of the large internal inductance of 
Si on Zp and 7 is especially apparent for thin dielectrics (where the ex- 
ternal inductance is small). It is noted in Figs. 13 and 14 that 7 passes 
through a local minimum at ¢; ~ 0.01 mm mils. Below this value (for 
aluminum ground planes) the constant internal inductance is greater 
than the external inductance; and the capacitance, which decreases 
with t, , causes 7 to decrease with t,. Above the minimum, as the in- 
creasing external inductance predominates, the fringing capacitance 
does not decrease linearly with t, so 7 rises. As tz increases further, the 
equations used in this paper cease to be valid and the inductance will 
not increase linearly with ¢, . 

In Figs. 17 and 18, the solid lines show the frequency dependence of 
the characteristic impedance. The variation of Re (Z,) with is particu- 
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Fig. 13—Impedance and delay time at 1 GHz for aluminum oxide dielectric on 
aluminum ground plane. 
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Fig. 14—Impedance and delay time at 1 GHz for silicon oxynitride dielectric on 
aluminum ground plane. 


larly large in the case of Si since the frequency independent external 
inductance is small compared to the internal inductance. For pulse 
applications in which an impedance match is desirable over a broad 
frequency range, the situation may be improved somewhat by increasing 
the frequency independent component of the inductance. This may be 
done by loading the line with magnetic material, the results of which 
are shown by the dashed lines in Figs. 17 and 18. The open triangles 
show the unavoidably greater propagation delay times associated with 
the loaded lines. It can be seen from the deriving formulas that only 
the product uzt, was considered significant, so materials of other 
permeabilities and appropriate thicknesses would be equally effective. 

The losses associated. with these lines have been calculated, and a sam- 
ple of the results is shown in Fig. 19. Attenuation (dB/cm) is plotted for 
two materials configurations at each of two frequencies for a range of 
substrate thicknesses. The losses are particularly great for silicon ground 
planes (0.00250-cm) and for small dielectric thicknesses (which result 
in low values of Z,). The losses decrease rapidly as line width increases. 
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The near-end crosstalk for semi-infinite lines is shown in Fig. 20 
as calculated without the influence of the magnetic material. For all 
geometries considered in this paper the near-end crosstalk is below 
4 percent and is typically 1 percent or less. The presence of magnetic 
material would reduce this further. 

Tor comparison, the transmission parameters of a 500 microstrip 
transmission line on a conventional 0.625 mm thick Alumina substrate 


are’’*® given in Table IV. 


3.4 Llectrical Results 


Effective transmission line interconnections for chip to chip on a 
layered integrated circuit substrate appear realizable. The major dis- 
advantages are variation of impedance with frequency, appreciable 
delay times (and variation of delay time with frequency causing pulse 
distortion) and significant losses. These problems can be minimized by 
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Fig. 15—Impedance and delay time at 1 GHz for silicon dioxide dielectric on 
silicon ground plane. 
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Fig. 16—Impedance and delay time at 1 GHz for 0.0125 mm thick silicon dioxide 
dielectric on silicon ground plane. 


careful design (for example, judicious choices to ¢,, and up). The problems 
of delay times and attenuation (characteristics directly proportional 
to the length of the line) will also be reduced by the much greater chip 
packing densities allowable with substrates of high thermal conduc- 
tivity. Nonetheless, the high losses encountered with Si ground planes 
(along with its lower thermal conductivity) make this material a dubious 
substrate choice* except for possible special cases in which wide lines 
could be used for all portions of the circuitry where attenuation need 
be considered and where the ground returns for high current lines 
(for example, power supply lines) could be deposited as surface metal- 
lization. 

* This configuration (0.0025 Q-cm Si as a ground plane) should not be confused 
with the possible use of high resistivity Si (1500 Q-cm) as a dielectric with gold 
surface metallization and a metal ground plane. T. M. Hyltin has measured the 
dielectric loss in microstrips with high resistivity Si dielectrics.!* Such a dielectric, 
when used in the configuration discussed in this paper, would contribute an addi- 


tional 0.5 dB/cm attenuation to the attenuation calculated above for aluminum 
ground planes. 
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Fig. 17—Impedance and delay time for silicon oxynitride dielectric on aluminum 
ground plane. 


In the case of aluminum ground planes, the losses on narrow (2 mil) 
lines are probably tolerable for present circuitry with the shortened lines 
due to higher packing. For longer lines, power leads, and so on, it may 
be necessary to use wider lines and connect to the beam leads of the 
integrated circuit chips through tapered section “‘transformers’’. The 
impedance transition need not be as great as the desired width change 
would indicate if ¢,, could be increased under the wide lines to compensate 
for the decreased LD and increased C. 

Many present applications for lower speed integrated circuit logic 
do not require matched transmission lines for all or any of the inter- 
connections. The primary electrical requirement is then the completion 
of interconnects between integrated circuit chips with a minimum 
capacitance to ground. For such applications the narrow conductors 
envisioned in this paper would nearly compensate for the increased 
capacitance per unit length due to reduced ¢; compared with conven- 
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tional 0.625 mm alumina substrates. Use of SiON dielectric (eg = 3.4) 
would additionally improve this situation. Resistive signal loss in this 
application should be negligible since typical input impedances are 
1-2K ohms. The possibility of shortened lead length due to higher 
packing densities on high thermal conductivity substrates then suggests 
the possibility of improved electrical performance. 


IV. CONCLUSIONS 


The advantages to be gained from a layered substrate are substantial 
from thermal considerations. The modifications of the electrical prop- 
erties resulting from the layered geometry do not appear to present a 
significant obstacle to the development of such substrates. Indeed 
many improvements in electrical performance appear realizable with 
thin line metallization and transmission line interconnections. Layered 
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ee 18—Impedance and delay time for silicon dioxide dielectric on silicon ground 
plane. 
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Fig. 19—Attenuation at 100 MHz and 1 GHz. 


substrates with massive metal ground planes will adequately satisfy 
rigid mounting plane requirements, and the physical configuration of 
unbalanced microstrip lines allows the use of reliable, inexpensive 
thermocompression bonding of a beam-leaded integrated circuit chip 
to the substrate. 
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APPENDIX A 


Averaging Technique for Improved Series Conversion 


Theorem: Let T(N) be a function of varying sign with zero crossings 
enumerated Zo, 21", 2: Zis1)” where 1 ts chosen such that 

















dT 
aN sess. > 0. (32) 
If 
Zita Zite Zits 
2 TO) | > > T(N) | > => T(N) : (33) 
Then 
DTW) + DTN) « > TN) + 2 TW) 
N=0 ; N=0 < p> T(N) < N=0 ; N=0 ; (34) 
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Say end crosstalk between adjacent, parallel 500 microstrip lines 
tm = 0. 
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TABLE [TV—5002 SrrRiIPLines on 0.625 mm ALUMINA SUBSTRATES 


Line width 0.625 mm 
Delay time 0.05 nsec/em 
Attenuation 0.048 db/em 
Crosstalk 6.75% 


Proof: (See Fig. 21) 


Write 
Zi 
> TW) = S; (35) 
N=0 
from inequality (33), 
t+1 4+3 
pe T(N) > De T(N) (36) 
SO 
Sia1 —_ S; > Ssi3 = S425 (37) 





Leo Stitt 
wok +3 





Fig. 21—Enumeration of extrema and zero crossings. 
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from inequality (32) 


[S8;— Sia) > | Sine — Sise | (38) 
SO 
|S; — Sere] > | Siar — Seas |; (39) 
similarly 
| Sis1 — Sins] > | Sis2 — Sire | (40) 
SO 
|S; — Siso| > | Sint — Sirs | > | Sire — Sire]. (41) 
Write 
S. = W. (42) 


Inequality (41) holds for any 7 chosen in agreement with inequality 
(32) asi — o. Then 


[S;-Wl>|8u—-W[>|S8iu2—W| (43) 
define 
_ 8s + Sins 
oan cone 


_ 


A, (44) 


but from inequality (48) 
|S; aa. W | > [sedi —W| 
and from inequality (82) 


S; < Seat 
so 
WSs > Si — W, (45) 

giving 

S,+ Sis. < 2W (46) 
so 

2 

From inequality (44) 

Ayet S41 + Sya2 (48) 
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but from inequality (48) 
[Sar —-W|>|Sua—W | 
and from inequality (32) 


Sia S Sigs 
SO 
Siar — W > We Sasa, (49) 
giving 
Siar + Sino > 2W (50) 
sO 
Ain > = W (51) 


from inequality (47) and (51) 
A: <W< Ain (52) 


and the theorem is proved. 


APPENDIX B 


Edge-ground Capacitance for Microstrip 


The capacitance from the edges of the finite thickness strip above 
the infinite ground plane is equivalent to the capacitance problem repre- 
sented in Fig. (22a). This is equivalent to twice the capacitance repre- 
sented in Fig. (22b). The conformal transformation W = sin™*(z/t,) 
results in the configuration of Fig. (22c). The capacitance per unit length 
of this configuration can be calculated by equations (26) and (27) of 
the text. This results in 


Conv SE | cosh (1 + i) 
T ty 


+ In (1 + {1 — exp [—cosh”’ (1 + w/in) | 





For t, = ta; 


cosh™* (1 +4) win (2424). 
ta ba 
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Fig. 22—Conformal mappings. 


Then 


m fren Jy (2ta + 2b," f ( + 2) 
ox ett tn ( ty )+mlit De 
_ &€2 {2 + 2, + [2a + tte + zal, 
T ba 


The edge-ground capacitance is then 


c, = Ree yy {Blut Bh + 2 + tN + 2107, 
vin 


ta 
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The Field Singularity at the Edge of an 


Electrode on a Semiconductor Surface 


By J. A. LEWIS and E. WASSERSTROM* 
(Manuscript received February 18, 1970) 


Near the edge of a charged electrode on the surface of a semiconductor, 
the field in the semiconductor may become very large because of the accumu- 
lation of charge at the electrode edge. Such large local fields are undesirable, 
not only because they may cause local breakdown, but also because they 
make the behavior of a semiconductor device difficult to predict. 

In the present paper we consider a simple mathematical model of an 
electrode edge-semiconductor-insulator configuration and derive conditions 
under which large local fields may be avoided. More accurately, since the 
electrode edge and semiconductor corner angles are assumed to be perfectly 
sharp, we derwe conditions under which the local field in the semiconductor 
is nonsingular. It 1s necessary to include the effect of the surrounding 
insulator, because even for small insulator-semiconductor dielectric constant 
ratios, a field singularity in the insulator will be coupled back into the 
semiconductor. 


I. INTRODUCTION 


Beneath a charged electrode situated on the surface of a semicon- 
ductor, at points far from the electrode edge, the electrostatic field is 
regular and quasi-one-dimensional, with its maximum value at the 
electrode. Near the electrode edge, however, the field may become very 
large because of the accumulation of surface charge at the sharply curved 
electrode edge.’ Also, the jump in dielectric constant between the semi- 
conductor and the surrounding insulating material may produce a 
large local field intensity. Such a field may be so large as to cause ava- 
lanche breakdown near the edge, but, in any case, the presence of such 
an edge effect makes the behavior of a semiconductor device difficult 
to predict. 


* On leave from the Technion-Israel Institute of Technology, Haifa, Israel, when 
this work was performed. 
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In this paper we consider a simple mathematical model of an electrode 
edge-semiconductor-insulator configuration, namely a sharp-edged elec- 
trode on top of a semiconductor mesa, as in Fig. 1. We study the be- 
havior of the potential, or rather its singular part, in the two wedge- 
shaped semiconductor and insulator regions shown in the inset circle 
of Fig. 1, assuming that the potential is locally planar and that its 
singular part satisfies Laplace’s equation, both in the insulator and in 
the semiconductor. Since the treatment is local and the electrode edge 
and mesa corner are replaced by mathematically sharp wedges, our 
analysis can only predict the existence or nonexistence of a singular 
field at the edge and cannot produce an estimate of local field strength, 
which depends on conditions far from the edge. 

We derive an estimate of the order of the singularity in the potential 
of the form ¢ = O(r’), where r is the distance from the corner and 
p > 0. The local field is thus of order r”~’, singular for p < 1. We con- 
sider p as a function of the semiconductor wedge angle a, the electrode 
wedge angle 8, and the insulator-semiconductor permittivity ratio 
n, for a and 8 between zero and 180° and 7 between zero and one. 

We find that, to avoid a field singularity, we must make @ greater 
than 90° and @ less than 90°. In particular, if we take 8 = 180°, any 
a less than 90° yields a nonsingular field. Such a configuration might 
be realized, for example, by using an overhanging electrode on an under- 
cut semiconductor mesa, as in Fig. 2. The length of the overhang must 
be several Debye lengths for small electrode potential and several 
depletion layer thicknesses for large reverse bias, in order that the 
present theory be applicable. 

Figure 3 summarizes our principal results. It gives the range of a, 
for 90° = B S 180°, 0 S n S 1, within which the field is nonsingular. 
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Fig. 1—Mesa with sharp-edged electrode. 
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Fig. 2—Overhanging electrode on undercut mesa. 


It should be noted that our results apply without modification to edge 
fields in capacitors and, with the appropriate interpretation, to steady 
temperature fields in conductors. In the latter case, it would be interest- 
ing to study the corresponding thermal stresses, as, for example, in a 
glass-to-metal seal. 


II. A MODEL OF THE ELECTRODE EDGE 


We consider the electric field in a current-free semiconductor, near 
an electrode edge whose cross-section is sketched in Fig. 4. The sketch 
shows a typical semiconductor mesa with corner angle a, surrounded 
by insulating material, and supporting an electrode with corner angle 
G. For given insulator-semiconductor permittivity ratio 7 = €/e, (0.1 
for air-silicon, 0.3 for silica-silicon), we wish to choose e and B to avoid 
local field singularities. 

In the semiconductor the dimensionless potential ¢ satisfies an equa- 
tion of the general form 


V’¢e = fe), (1) 


where the Laplacian operator is made dimensionless with the Debye 
length for small potential and by the depletion layer thickness for large 
reverse bias (see, for example, Ref. 2). “Large” or “‘small’’ distance then 
means large or small with respect to one of these typical lengths. In 
particular, we shall assume that the electrode is so large in a direction 
perpendicular to the cross-section that the local field may be treated 
as planar. 

We seek solutions of equation (1) which have field singularities at 
the vertex r = 0, that is, a bounded potential » such that | V¢| is 
unbounded at r = 0. In the neighborhood of such a singularity the in- 
dividual second derivatives which make up the Laplacian will be very 
large, although they must combine to make the Laplacian equal to the 
bounded function f(g). As far as the singular part of the solution is 
concerned then, the specific form of f(y) is unimportant and we can in 
fact set f(¢) equal to zero. The singular solution then satisfies Laplace’s 
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Fig. 3—Bounds on semiconductor angle a as a function of electrode angle 8 for a 
nonsingular field. 


equation 
V*o = &e/dr’? + dg/r dr + A e/r? 00? = 0, (2) 


in the neighborhood of r = 0, both in the semiconductor (0 < @ < a) 
and in the insulator (a < @ < 2a — 8). At the electrode faces we have 
the boundary conditions 


g(r, 0) = g(r, 2r — B) = 1, (3) 


while at the semiconductor-insulator interface, in the absence of surface 
charge, we have the continuity conditions* 
* As we shall see, no matter how small 7 is, equation (5) couples any singularity 


in the insulator back into the semiconductor. Satisfaction of this condition is an 
essential feature of the problem. 
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Fig. 4—Semiconductor-electrode edge configuration. 


g(r, a7) = g(r, a"), (4) 
dv(r. a)/80 = ndg(r, a*)/d0. (5) 
We now expand ¢ in positive powers of 7, setting 
f + 3 A,r” sin p,6, for OS 6S a, 
g(r, 6) = ae 
[1+ > By" sin pie — B— 8), for aS dS 27-68, 


satisfying equation (2) and boundary condition (3). The A’s, B’s, and 
p’s are chosen to satisfy the continuity conditions (4) and (5), which 
take the form 


A snpa — Bsinp(2Qr — B—a) = 0, 
A cos pa + 7B cos p(Q2r — B — a) = O. 


These equations have a nontrivial solution only if the coefficient deter- 
minant vanishes, giving the characteristic equation for p 


n Sin pa cos p(2r — B — a) + cos pa sin p(Qr — B — a) = 0. (6) 


We wish to find the smallest positive value of p which will satisfy this 
equation, as a function of a, 8, and 7. For this value of p we have 


¢ — 1 = 00”), 
| Ve | = 00’), 
singular for p < 1, in the neighborhood of r = 0. 
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Ill. THE FLAT SURFACE AND THE MESA 


Before treating the general problem, let us consider two special cases, 
in order to gain insight into the behavior of p as a function of a, B, and 
n. In the very common case of a thin electrode (8 = 0) on a flat semi- 
conductor surface (a = 7), as in Fig. 5, the characteristic equation (6) 
reduces to 


sin pr cos pr = 0, (7) 


for all 7. It is satisfied by p = 1, p = 3, the latter being the smallest 
value of p. In this case, near the electrode edge the field is singular, like 
r?, no matter what insulating material is used. This is the same singu- 
larity as that obtained in the classical Weber problem of the disk elec- 
trode. The same local behavior was also found in Ref. 3, where a closed- 
form solution of the above problem was derived for the linearized 
semiconductor equation and 7 = 0, 7 = 1. 

Now let us attempt to reduce the singularity by cutting away the 
semiconductor, placing the electrode on top of a mesa, as shown in Fig. 
6 for a = 7/2. For 7 = 0 this configuration gives a one-dimensional, 
regular field, normal to the electrode, so that we might expect that it 
would be advantageous for small 7. With 8 = 0, a = 7/2, equation (6) 
becomes 

pain pone rs ce 0 (8) 

2 2 2 2 ; 

satisfied by p = 1, for all ». However, note that, for 7 = 0, it also has 
the smaller root p = 3. If we examine the equations for A and B, we 
find that, in this case, A = 0, so that the root p = 2 gives a singularity 
only in the insulator for 7 = 0. However, a perturbation for small posi- 
tive 7 shows that A is of order 7, so that the singularity is coupled back 
into the semiconductor for any positive 7, no matter how small. 

On the other hand, for 7 = 1, the smallest root of equation (8) is 
p = 3. In this case p is independent of a, for 7 = 1 corresponds to a 
single dielectric filling the whole space around the electrode. Now, if 


SEMICONDUCTOR 





Fig. 5—Flat semiconductor surface. 
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Fig. 6—Right-angled mesa corner. 


p is a continuous function of 7, its value for0 < 7 < 1 must lie between 
the value for 7 = 0 and the value for » = 1; that is, we must have 


=p <4, 


for 8 = 0,a = 7/2, and0 < 7 < 1. Forsmall n, the singularity is weak- 
ened in some sense, but not removed, by the formation of a mesa. 

These two special cases indicate that the calculation of p = p(a, B, ») 
is not completely straightforward. They also suggest that the simple 
cases 7 = 0, 7 = 1 can be used as a framework for the general calcula- 
tion. 


IV. LIMITING VALUES OF PERMITTIVITY RATIO 
Let us first consider the case 7 = 1. In this case equation (6) becomes 
sin p(2x — 8) = 0, 
independent of a, as one would expect. Its smallest positive root is 


p = p(a, 8,1) = pi(8) = «/(2r — 8), (9) 


singular for 8 < a. Those familiar with potential theory will recognize 
this as the singularity at the tip of a wedge-shaped electrode, protruding 
into a uniform dielectric. It has been studied in detail by Wasow, 
Lehman, and Joyce.*° 

The other limiting case y = 0 gives two roots. Equation (6) becomes 


cos pa sin p(2Qr — B — a) = 0,7 
with the roots 
p = p(a, B, 0) = po(a) = 1/2a, (10) 
p = pla, 8, 0) = po@, 8) = x/(2nr — B — a). (11) 


The first of these roots gives a singular field in the semiconductor for 
a > 7/2; the second gives a singular field for a + 8 < a (in the insulator 
only, for 7 = 0) but weakly coupled back into the semiconductor for 
small positive 7. 
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For 0 < 7 < 1, p must lie between p, and the smaller of the two 
values p>, , p4,. Now py > p> fora < (22 — B)/3, so that p lies between 
pi and p;,for0 S a S (2m — 8B)/3, and between p, and pj, for (27 — 
8)/3 Sa Ss x. Also py < pi, fora > (24 — 8B)/2, so that we finally 
obtain the series of bounds listed below: 








7 T 27 — B 
Pag NO Pee ae a for 0<a<—3— ? (12) 
7 T 2r — B 2r — B 
Onerg hae! for a <a<- 3 , (18) 
is 1 2r — B 
on <P <a, 8! for 3g Sat, (14) 


Now the largest value of the upper bound is attained at the point where 
the upper bound of equation (12), an ascending hyperbola as a function 
of a, meets the upper bound of equation (13), a descending hyperbola, 
that is at the point a = (27 — 8)/3, where p = 37/2(2r — B) = Daax- 
For 8 < «/2 this maximum is less than unity, so that a field singularity 
can be avoided only by choosing the electrode angle 6 greater than 90°. 
Similarly, the upper bound of equation (13) implies that the semi- 
conductor angle a must be chosen less than 90° to avoid a field singu- 
larity. 

A particularly simple way of satisfying these requirements is the 
combination of overhanging electrode (8 = 7) and slightly undercut 
semiconductor mesa (a S 7/2) shown in Fig. 7. In order that the theory 
be applicable, the length of the overhang must be several characteristic 
lengths, i.e. several Debye lengths for small electrode potential and 
several depletion layer thicknesses for large electrode potential. The 
mesa corner needs to be undercut only enough to be certain that @ is 





Fig. 7—Overhanging electrode on undercut mesa, 
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never greater than 90°, within fabrication tolerances. In Section V we 
document these preliminaries. 


V. ARBITRARY PERMITTIVITY RATIO 


For detailed calculations it is convenient to rewrite the characteristic 
equation (6) in the form 


(1 + ») sin pr — 6) + (1 — 4) sin pQ@r — B — 2a) = 0 (18) 
from which it is a simple matter to calculate the derivative 


Op _ 2(1—n)p cos p(2r—B— 2a) 
da (1+n)(2r—8) cos p(2r—8)-+ (1 — 9) (247 —B— 2a) cos p(24—B—2a) 


(16) 


Now, whereas it is difficult to solve equation (6) directly for p as a 
function of a, 8, 7, because of the existence of neighboring higher roots, 
it is a simple matter to integrate equation (16), starting from a = 0, 
where p = 2/(2xr — 8) for all y. This calculation was carried out for 
n = 0, 0.1, 0.3, 0.5, 0.7, 1.0 and for 8 = 0, 1/2, 37/4, x. The results are 
shown in Figs. 8 through 11, for the two lowest branches of p.* As 
Section IV predicts, a nonsingular field (p = 1) becomes possible only 
for 8 2 7/2, with the range of permissible values of a and 7, increasing 
from a = 7/2, n = 0, at B = 7/2, tOo0 < a < 2/2,0 < y <1, at 
B=. 

This range of values of a for given 6 and 7 is easily found. If we set 
p = 1 in equation (15), we find that it is satisfied by two values of a, 
which bound the permissible range for a nonsingular field. Figure 3 
shows the result of this elementary calculation. For example, for 8 = 
37/4, » = 0.1, the field is singular for a < 0.2917 = 52.5°, regular 
for 0.2917 < a < 0.4597 = 82.5°, and singular again for larger a. The 
slanting dashed line gives the minimum value of 8, and the correspond- 
ing value of a, for which the field is nonsingular for given 7. 
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Fig. 8—p(a) for 8 = 0 and various 7. 





Fig. 9—p(a) for 8 = 7/2 and various 7. 
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Fig. 10—p(a) for 8 = 37/4 and various 7. 
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Fig. 11—p(a) for 8 = a and various 7. 
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Determination of the Shape of the Human 
Vocal Tract from Acoustical 
Measurements 


By B. GOPINATH and M. M. SONDHI 
(Manuscript received January 14, 1970) 


In this paper we describe methods for determining the cross-sectional area 
function of the human vocal tract from acoustical measurements made at one 
end. The pressure and volume velocity are assumed to obey Webster’s horn 
equation, which is valid for frequencies below 3.5 kHz. Acoustical 
properties below 3.5 kHz do not uniquely specify the area function. This 
paper shows how high frequency information may be incorporated into 
the mathematical model in a manner consistent with a priori information 
about the vocal tract. Some results of application of the methods by computer 
simulation are presented. It is interesting to see from the figures that nine 
numbers (namely, length, four formants, and four residues) specify the 
area function quite well for practical purposes. 


I. INTRODUCTION 


In recent years there has been considerable interest in the modelling 
of speech production in terms of the motion of the articulators. This 
interest has stimulated work on the determination of the shape of the 
human vocal tract as a function of the utterance. For frequencies less 
than 3500 Hz, wave motion in the vocal tract is essentially planar, so 
that the shape is effectively specified by the cross-sectional area as a 
function of distance from one end of the tract (say from the glottis). 

During the past two decades X-ray techniques have been used to 
determine these area functions. These techniques suffer from two major 
drawbacks: (7) In order to keep the exposure to X-rays within safe 
dosage limits, only a small number of measurements can be made on 
any one subject; (22) The interpretation of X-ray data is a complex 
and difficult art, and a number of assumptions must be made in order to 
convert this data to area functions. The accuracy with which area func- 
tions are reconstructed is rather limited. 
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In 1965, Mermelstein and Schroeder suggested the new approach 
of inferring the area functions from acoustic information.’ Under the 
usual assumptions of lossless plane wave propagation, they showed that 
if the area function A(x) of a vocal tract of length J is of the form 

log A(z) = log Ay + >> a, oo (1) 
n=l 
then in the limit as a, — 0 for all n, the nth eigenfrequency (with the 
tract closed at x = 0 and open at x = 1), is given by 


he as Non(1 = 5A2n-1) (2) 
where Xo, is the nth eigenfrequency of the uniform tract (a, = 0,7 = 
1, --:). Likewise, to the same approximation the nth eigenfrequency 
with the tract closed at both ends, is given by 

Ln = Mon(1 aa 4 don): (3) 


Using equation (2) to obtain a2,.; from w, , Mermelstein and Schroeder 
obtained antisymmetric approximations to area functions from a knowl- 
edge of formant frequencies alone. This work was extended by Schroeder” 
and Mermelstein® to include the even-order coefficients [using measured 
values of the poles and zeroes of the input admittance at the lips, which 
correspond respectively to the )’s and p’s in equations (2) and (3)]. 
An extension was also made by Mermelstein® who devised an iterative 
algorithm to compute the first 2m coefficients in equation (1) from a 
knowledge of (A; *++ Am, M1 ‘** Hm), When the perturbations of the 
eigenvalues are too large for first order perturbation theory to be ac- 
curate. Another iterative scheme has been obtained by J. Heinz, by 
applying perturbation theory to tracts of arbitrary shape.* 

These methods are applications of very general techniques (namely, 
perturbation theory and steepest descents) which do not make use of 
the special characteristics of the problem at hand. They also leave 
unanswered certain mathematical questions such as the convergence 
of the iterative procedures and uniqueness of the solution. 

In this paper, we describe two (noniterative) methods for computing 
the area function from acoustical data. Apart from clarifying the 
physical and mathematical aspects of the problem, these methods 
provide solutions in a form suitable for analyzing the sensitivity of 
the reconstructed area functions to inaccuracies of the data. They 
also enable us to answer such basic questions as: ‘‘What tube has all 
but a finite number of eigenvalues identical to those of a uniform 
tube?” 
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In Section II we introduce the wave equation and Webster’s horn 
equation and list the basic properties of the solutions and eigenfunctions 
of the horn equation under homogeneous boundary conditions. 

In Section III we present a method for computing the area function, 
based upon the factorization of the kernel of an integral operator which 
transforms solutions of the horn equation to the solutions of the equa- 
tion for a uniform tract. The existence of such a transform was proved 
by Marchenko,’ and the transform has been used in the solution of 
the inverse-Sturm Liouville problem by Gelfand and Levitan.° 

In Section IV we present an alternative method for computing the 
area function based upon the solution of an integral equation whose 
kernel is the driving point response to an impulse at one end of the 
tract. This integral equation was introduced without derivation by 
Krein in a paper on an application of his theory of extensions of positive 
definite kernels.’'* Our derivation is physically motivated and uses only 
elementary theory of forced motion of a second order system. 

In Section V we present preliminary results of an application of 
our methods to the determination of vocal tract shapes and a comparison 
with X-ray derived data. Figures 3 through 9 show the results of these 
computations. 


Ii. MATHEMATICAL PRELIMINARIES 


For a tube of variable cross-sectional area A(x) the equations relat- 
ing acoustical pressure p and volume velocity V are 


Op p OV 





ee A) OF (4a) 
aV _ _ A(z) ap 
oz ——t—éipstl’”s«éCE' ’ (4b) 


under the assumption of lossless plane wave propagation in the tube. 
These assumptions are accurate for the vocal tract for frequencies 
up to about 4 kHz. For convenience we will choose units such that 
the velocity of sound c = 1, the density of air p = 1 and the length 
of the tube is z. Then elimination of V in equations (4a) and (4b) 
gives 


a Op _ op = 
oe A(z) aa A(2) ae OS2S7; (5) 


and for sinusoidal time dependence, such that p = (zx, \)e™‘, the 
function (2, \) satisfies 
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IIA 


2 4@) 22% 4 ya@e@e,)=0 OSesr ©) 
which is Webster’s horn equation. Throughout this paper it will be 
assumed that A(z) > 0 (0 S$ x S x), A(O) = 1, and that A(z) has 
continuous first and second derivatives except at a finite number of 
points in [0, z]. At the points of discontinuity A(x) and its first two 
derivatives are assumed to have finite right and left limits. Under these 
conditions on A(x) the following lemma holds. 


Lemma 1: The solution of equation (6) satisfying the initial conditions 


¢0,)=1, 9/0,) =0 (7) 
exists, and 
| [A@Pb(@, 0) — [Aola)}¥(e, ») | = ©) 
where 


0S K <a, OS 2S 1, 


and (x, d) is the solution of equation (6) with the same initial conditions 
and A(x) replaced by a canonical shape A,(x). The function A(x) ts 
such that A,(0) = A(O) = 1, Ao(x) ts constant everywhere in [0, a] except 
at points of discontinuity of A(x) where it jumps by the same factor as 
does A(x). 


The proof of this lemma is given in Appendix A. 

The solutions of equation (6) satisfying the initial conditions (7) 
become eigenfunctions if they satisfy some homogeneous boundary 
condition at x = zw. These eigenfunctions and eigenvalues have well 
known properties which for the specific case ¢(z, 4) = 0 we summarize 
in the following lemma. 


Lemma 2: If A(x) satisfies the conditions described above, then there 
exists a sequence d,; (the eigenvalues) satisfying 


(2) \, > 0,A; 9 ~ ast Oo; 
(21) (x, ;) are solutions of equation (6) satisfying the initial conditions 
(7) and the condition o(m, \;) = 0; (8a) 


Git) A@ $e, DBC, 99 dea 


I 
- 


tj, 


= Ai y t= J, (9) 
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with 
0:< al < @; 


(wv) (a, ,) are complete in the space L,(0, 7) of square integrable 
functions. 


An immediate consequence of Lemmas 1 and 2 is the following corol- 
lary. 


Corollary 2.1: If u, is the sequence of eigenvalues for the canonical tube 
A,(x), with the conditions atx = 0 and x = was in Lemmas 1 and 2, then 


d= we + 0(4) (10) 
and the of; of Lemma 2 satisfy 


a= [ Av, n) dx + 0(4) 


atl 
=i + o(4) 


where y is as defined in Lemma 1. 


(11) 


Finally we will require the following lemma. 

Lemma 8: There exists a function H(a, t) such that 
[Ao(a)?¥(w, 4) = [A(@)]}’o(@, ») + [ H(x, )[A@YVe(t, \) dt. (12) 
This can be proved by substituting equation (12) into equation (6). 


After trivial, but involved, algebra (see Appendix B) it turns out that 
for (12) to be true, H(2, t) must satisfy the following: 


PH (x,t) PH(x, ) , (AOVY” = 
H(z, a) = —5 [at (A@HY bo = 8 
(LADIV HG, 9 leoo — BO] = 0. (15) 


The theory of partial differential equations guarantees the existence 
of a solution to equation (13) under the boundary conditions (14) 
and (15). 
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III. DERIVATION OF A(x) FROM THE SPECTRAL FUNCTION OR TWO SETS 
OF EIGENVALUES 


The spectral function is defined as a staircase function of \ with jumps 
of af at \; (¢ = 1, ---). Thus to say that the spectral function is known 
is equivalent to saying that the pairs (A; , af) 7 = 1, --- are known. 
Appendix C shows that if \; (¢ = 1, ---) are the eigenvalues of the same 
tube for the conditions ¢’(0, 4) = 0, 6(0, A) = 1, ad(a, A) + b¢’ (7, A) = 0, 
(b ¥ 0), then a knowledge of the pairs (\;, 4), 7 = 1, --- specifies 
the spectral function. Also, in Section IV it will turn out [see equation 
(27), with x = 0] that \,; is the zth pole of the driving point impedance, 
and 1/20; the corresponding residue. 

We now derive A(x), given the spectral function. In cases where A (x) 
has continuous first and second derivatives the spectral function 
suffices to uniquely determine A(x). In cases where A(z) has a finite 
number of discontinuities, the locations and magnitudes of the jumps 
are also assumed to be known. Note that equation (12) may be written 
in symbolic form as 


[Ao(a)¥(e, ) = (I + Hota, )[IA@)? (16) 


where IJ is the identity on (L"[0, r]) and H the integral operator such 
that 


g = Hi oe) = [ He, 1 a. 7) 


Define the operator U which takes a square summable sequence of 
real numbers f;, 7 = 1, 2, --- to a square integrable function f(z) 
[on (0, )] defined as 


fla) = FAM POC, 09/a5 (18) 


Define the adjoint operator U* which takes a square integrable function 
f(x) to a square summable sequence f; given by 


j=l tee, TA@Pde i= 1+. (19) 


Let R and R* be defined analogously to U and U*, with [A (z)]*6(z, ,) 
replaced by ¥(z, \,;)(A.)* in equations (18) and (19). 
Then 


k= (2+H)U 
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and 
RR* = (I+ H)UU*W + AD*. (20) 


However, from the completeness and orthogonality of the [A (x) ]’6(z, »,), 
it follows that UU* = I. Thus 


RR* = (I + HY) + B)*. (21) 
Note that 


*[Ao(é)Ao(a)} dé (22) 


(because [Ao(x)]*p(z, ,)/y, is an orthonormal and complete sequence). 

From the asymptotic formulae of Lemma 1, it is seen that the kernel 
of the operator RR* — I is of the Hilbert-Schmidt type [that is, square 
integrable on the square (0 S$ x, ¢t S$ =).] Therefore, if the \; , a; cor- 
respond to those of a tube with appropriate boundary conditions then 
the factorization of equation (21) is always possible. 

This essentially completes our derivation. For the kernel of RR* 
can be constructed if the \,’s and a,’s are known [and in the case of 
discontinuous A(x), the positions and magnitudes of the jumps are 
also known]. The factorization (21) then gives H(a, t). Finally, since 
¢(z, 0) = land Y(z, 0) = 1 (0 S x Sn), equation (12) gives 


A@E+ | He, A@H at = [oP (23) 


which can be solved for [A (z)]*. 

Although, in general, the factorization of equation (18) is difficult, 
we will show in Section V an effective method of computation when 
all but a finite number of ),;’s and a,’s are identical to the corresponding 
p,’8 and y,’s. 


IV. DERIVATION OF THE AREA FUNCTION FROM THE INPUT IMPEDANCE 


In this section, for simplicity, the area function and its first two 
derivatives will be assumed continuous. Consider the forced pressure 
response y(x, t) in the tube, due to a unit ramp r(t) of volume velocity 
at x = 0. This may be obtained by including a term 6(x)r(é) on the right 
hand side of equation (4b). The resulting equation for y(z, ¢) is 


m A(a) Me t) me A(z) Tne = — 8(z)u(t) (24) 
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where u(é) is the unit step. Integrating equation (24) over x from 0 
to a gives 








A(a) 4) = — ae 2 -| A) Za, !) dx = ~1 4 = 0. (25) 
OL |2=a OX |z=0 0 

In this equation we have put u(é) d’y/at? = Z(z, 1); it is the transfer 

impedance (in the time domain) from the input end to the point z. 

However (dy/dx) = O at x = O because of the boundary condition. 

Also, since the velocity of sound has been normalized to unity, for 

t S a the region of the tube beyond x = a is undisturbed. Thus for 


t < a, A(z) dy/dx = O for x 2 a. Thus equation (25) becomes 


IA | 


i AGIG@- Dies t Oi=% 


IIA 


a. (26) 


By expansion in terms of ¢(a, \;) it can be verified that 


2 2) 
Ze, 1) = ae t) _ Ss $(x, d,) £08 Ast 1>0 (27) 
i=1 ay 
where the convergence is assumed to be in the sense of distributions, 
and ¢; , a; , A; are as defined in Section IT. 
Let f(t) be a function such that 


[i@z@,oa=1 asa. (28) 
0 
Then by substitution into equation (26) it follows that 
[a= [ A@ ar. (29) 
0 0 


The interesting duality in equations (26), (28), and (29) enables deter- 
mination of A(x) in terms of Z(0, ¢) rather than Z(z, t). Multiplying 
equation (28) by A(x)Z(z, s), integrating over x and changing order 
of integration on the left-hand side we get 


i “4() di i Ale) Z(a, )Z(a, 8) de = i " A(a)Z(a, 8s) dx i <a. (30) 


For s < a, the right-hand side equals unity by virtue of equation (27). 
On the left-hand side the integration limits on x can be changed to 
(0, +) since Z(x, t) = 0 for x > t. Then substituting for Z(x, t), Z(a, s) 
from equation (27) and using the orthogonality equation (9) we get 


i f(t) dy SEA GOS it Ah oo Mere a Ble (31) 
0 ' 
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[Equation (81) may also be obtained by multiplying equation (28) 
by [A(x)]*? and using the linear transformation of Section III.] Defining 
fit) = f(|t|), |¢| S a we note that f(®) satisfies 


/ 40 ss COS A;S COS A;E = sin \;8 sin A;# apa is| Sa (32) 


Qa; 


since sin \,t is odd. From an elementary trigonometric identity it then 
follows that 


IIA 
g 


[ JO7Oeoearet. . % (33) 


and, from equation (29) 
| ade = / - AG) ae: (34) 


Thus, if Z(0, ¢) (which is the driving point impedance function at « = 0) 
is known, or measured, then solution of equation (33) for each a gives 
the area function. [Note that to get A(x) for x S a, Z(Q, t) is required 
for t S 2a, as expected from physical considerations. |* 

We close this section by noting that although we have discussed 
the method in terms of measurements made at « = 0, where the bound- 
ary condition corresponds to a closed end, trivial modifications are 
needed if measurements are to be made at an open end. In the latter 
case since the pressure vanishes, dv/dx = 0. Therefore, the same method 
is applicable to the horn equation for volume velocity, with a measure- 
ment of driving point admittance (instead of impedance). The driving 
point impedance (admittance) may, of course, be evaluated from 
measurements at any end with an arbitrary, known, termination. 


V. APPLICATION OF THE METHODS TO DETERMINING VOCAL TRACT AREA 
FUNCTIONS 


As noted in the introduction, the one dimensional Webster’s horn 
equation is an accurate description of wave propagation in the vocal 
tract, only for frequencies less than about 3.5 kHz. Hence the 2, of 
Section II have no physical counterpart whenever they exceed 3.5. 
We therefore start with the \; and a; (2 = 1, --- n) as measured data, 
and assume that for 7 > n, the d,; and a; are identical with those of some 
canonical tube. In view of the asymptotic formulae of Section II, 
this assumption is reasonable. 


* For another aac of equation (33) see Appendix D where it is further 
shown that f(a) = [A(a)}”? 
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We know of no a priort method for choosing the canonical shape so 
as to give the best match between the computed area functions and 
those of the actual vocal tract. For simplicity one might assume the 
canonical tube to be uniform. However the experimental area functions 
published by Fant? all show a sharp discontinuity at the epiglottis, 
which suggests choosing a canonical tube with such a discontinuity. 
We have tried both these canonical shapes. 

Once a canonical shape has been chosen, e; , A; , (x, A;) and W(a, ui), 
4 = 1, --- m may be computed. Under the assumptions of this section, 
the factorization of equation (21) can then be carried out in the following 
manner. 

We use the vector notation 


k(x) = [W(a, d.)/on ++» WOR, An)/Orn 
W(x, ws)/M1 ts WE, Mn) /Vn] 

keo'(x) = [h(a aa)/ar , ++» W(@, And/or 
— V(x, w/t WX, Me)/Va] 


where k,(x) and k,(x) are n-dimensional column vectors and the super- 
script 7’ denotes transposition. Then the kernel of equation (22) be- 
comes k7(x)ka(t), and it is easily seen that H(z, t) has the form 


H(a, t) = ki(z)h(d) x>t (35) 

where h(t) is some vector function of t. Then equation (18) becomes 
t 

ki(x)ko(t) = ky (a)h(d) + kr(@) | i h(c)h"(c) ae fiat (36) 
0 

as long as \; ¥ w;,7 = 1,--- n. fA; = uw; for some ?z, a slight modifica- 


tion is necessary.) However, from the linear independence of the com- 
ponents of k,(x) it follows that 


t 
Ik.(2) = h(t) + | [ h(o)h"(c) de fs. (37) 
Equation (87) can be solved for h(t) by the analog circuit shown in 


Fig. 1, or by an equivalent computer simulation. Also, since equation 
(23) now becomes 


[AGP + Ki) [ HOLA@P at = [4ol0)? (38) 


the analog circuit of Fig. 2 yields [A(z)]*. 
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k5 + h 


T 
k, hh 


Fig. 1—Analog computer circuit for computing h. 


The results of such computations for some area functions published 
by Fant’ are shown in Figs. 3 through 9. 

We close this section by noting that if the experimental data is in 
the form of a driving point impulse response, then the simplest procedure 
is to use the method of Section IV [that is, to solve equation (83) for 
various values of a]. We have not computed area functions by this 
method so far, but propose to do so, using impedance tube or other 
experimental data. The limitations due to the inapplicability of the 
horn equation at high frequencies apply to this method as well. The 
effect of low-pass filtering the driving point response is being inves- 
tigated. 


VI. CONCLUSIONS AND DISCUSSION 


The comparison between measured and computed area functions 
of Figs. 3 through 9, indicates that knowledge of the first few (A, a) 
pairs is sufficient to get reasonable estimates of the area function. 
The X’s may be obtained directly from the speech output, since they 
can be computed with reasonable accuracy from the formant frequencies. 
The a’s on the other hand cannot be computed directly from the speech 
waveform, and impedance tube or other equivalent measurement would 
appear to be necessary. However, the vocal tract has physical con- 
straints which might be reflected in a functional dependence of the 
a’s on the d’s. The possibility of such functional dependence is being 


(Ao)? : (A) 


ky « h 


Fig. 2—Analog computer circuit for computing A. 
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(x) —> 


_ Fig. 3—Area functions reconstructed from the first three poles and residues of 
input impedance by the method of Section V using a uniform canonical tube. 
Dashed curves are the X-ray measurements. 


investigated. The sensitivity of the computed area functions to changes 
in the a’s is also being investigated. 

If one is willing to make acoustical measurements at the lips, then 
the method of Section IV is the most direct way of computing the area 
function. It has the added advantage that the length of the vocal tract 
need not be known. Some preliminary results on the effect of band- 
limiting the impulse response have been obtained and will be reported 
in a later paper. 


APPENDIX A 


Proof of Lemma 1 


Under the assumptions of this lemma, equation (6) may be trans- 
formed to: 


{[A@ Fo, 9} + VTA@)e@, ») = (AMI "e@» — (39) 
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except at the points x,, --- x, where A(x) is discontinuous. With 
% = O and x4; = m, equation (89) gives 


[A@) Fe, ») 


= foe) + [AE —O ITE AMP ol, ») at, 
Ce SPS Bee A= OA Gk (40) 


where 
f.(z, \) = a,(d) cos Aw + B,(A) sin deo. 


The coefficients a;(A), b;(A) are to be determined so as to make ¢(z, \) 
and A(x)¢’(2, \) everywhere continuous. (The conditions at zx = 0 
give do(A) = 1, bo(A) = 0.) Clearly for every » there exists a bound 
m(d) = sup | [A(z)}*o(a, d)|, 2; S @ S 2;4,. Then from equation (40), 


A(x) —> 





(2) —> 


Fig. 4—Same as in Fig. 3, except the canonical tube was chosen with a dis- 
continuity at the epiglottis. 
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z dyer 
m0) $ 1fe,» |+2m0) J Jsinage — 9 | | LAr" 


[ai(d) + BFA)]}t + Mm.r)/r (41) 


lA 


dt 





lA 


where J is a bound on the integral for all 7. Thus for \ > 2, 
| [A@) Fe, d) | S 2laia) + BAYH 
4 2y:(A). 


(42) 


CROSS-SECTIONAL AREA 





DISTANCE FROM GLOTTIS 


Figs. 5-9—Area functions reconstructed from the first four poles and residues: 
(a) the reconstruction using a uniform canonical tube, and (b) the reconstruction 
with a discontinuous canonical tube as in Section V. Dashed curves are X-ray 
measurements. 
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Then from equation (40), 
1 vi(A)e:(x) 
[A(x)]’o(z, d) = Tie dr) + x ’ Xi, < U2 < V1 (43) 


where c,(x) is bounded. Differentiating equation (40) and using a 
similar argument gives, 
OT 20M) _ LEN _ 1.0) dr, 


x; < x < p41 (44) 


nN Ox nN Ox 
with d,;() bounded. Defining 
k, = laaie aka 0 
0 [A(@wi—)/A(a:+)} 


and 
COS AX; sin Az, 
R; = ss 
sin Aw; —COS Az; 


the continuity conditions at x; give: 
nal 2 Rac) 
b;(A) b;-1(A) 
+ 730) pa oo] — 7) nf (45) 


d;—1(2;) d;(x;) 


Since the norm of R; is unity and of k; finite it follows upon taking 
the lengths of the vectors on either side of equation (45) that for large 
enough A, 7.(A) S K’y;_-1(A), for some finite constant K’. Since yo(A) = 1, 
it follows that y;(A) is bounded for all 7, as \ — o. Then from equation 


(45) 
| = ne] + 03). (46) 


b; bj-1 


However, if a; , b; satisfy equation (46), then f;(x, 4) = [A,(x)]'W(x, ») + 
O(1/\) forx; S v S x,,,, therefore from equation (43) 


[AQ Hola) = (Aol) ya) + © (47) 


with c(#) bounded. 
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APPENDIX B 


Existence of the Linear Transformation of Equation (12) 


In this appendix we prove the existence of the linear transformation 
of equation (12). Consider first the region 0 S x S 2,, where 2 is 
the first point of discontinuity of A(x). Then with y(x) = [A(z)]*6(a, 2) 
and g(x) = {[A(x)]}*}’/[A(z)]', equation (6) becomes 


y'"(x) = —y(x) + a@)y@). (48) 
Consider the function 
sa) = [ He, yl) at. (49) 
Then - 
a") = 2 H(, a)y(o)] 


weet t) yd | 





* PHA (x, t) 
+f SS wa. 60) 
After integrating twice by parts we arrive at the identity 


i. OHO yy at = AED wy | — He, y Y 


+ — + 


t=2z 








t=0 


xz & 
+ | He, ) hat. (61) 


Substituting for d’y/dt’? from equation (48) into equation (51) and 
adding equation (50) we get 


wa) rin = [ [PEED PHD + anaee, | 


-y(t) dt + 2y(a) © H(a, 2) 


oH He. 8 t) dy (t) 


yO 71 


If now H(z, t) satisfies the differential equation (13) with boundary 
conditions (14) and (15), then equation (52) shows that (x) + y(z) 
is some linear combination of cos \xv and sin Ax. Matching of boundary 
conditions at « = 0 then shows that 


(a) + y(z) = cos dx = [A,(x)PY(@,d), OS eS 2. (58) 








e + H(a, 2) (52) 
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The proof may be extended to x > x, by a similar procedure. Thus, 
for example, in the range 7, S$ x S 2x, H(z, t) must satisfy the dif- 
ferential equation (13), the boundary conditions (14) and (15) for 
x > x, , and boundary conditions at x = x, imposed by the continuity 
requirements on $(z, A), W(x, A), A(x)’ (a, A) and Ao(x)y¥/(z, 2). 


APPENDIX C 


Spectral Function from Two Sets of Eigenvalues 


We outline a method of getting a spectral function from two sets of 
eigenvalues. Let ¢(x, \) be the solution of equation (6) such that 


g(r, A) =a, Alm)y’(r, dr) = B (54) 
for every \. Let ¥(z, d) be the solution such that 
Wr, +) = 7, Alm)p'(m, rd) = 6. (55) 


Let \?, 42, «++ be the values of \” for which ay(0, d) + bA(0)¢'(0, d) 
= 0 and let w?,u2, °°: be the values of py’ for which ay(0, ») + 
bA(0)¥/(0, ») = 0. Let 


ay(0, d) + bAO)Y’O, d) | (56) 


mr) = 75(0, ) + DAO YO, ») 


Then the zeroes of m(A) are u1, us, °** and the poles are \;, 2, °°. 
If X(z, ) is any solution of equation (6), then it is easily shown that 


a? = xD [ A@XG, Nolo, r9) dz 


= A(x)[X(x, Ne"(x, ») — e@, WX"a, NI. 7) 


Choosing X(z, \) = (x, 4) — m()¢e(@, d) in equation (57) and using 
the boundary conditions on ¥(z, d) and ¢(z, A), we get 


OF = 29 | AGE,» — mo, Nl de = By— ad 68) 
for all \. ASA — A, 
ai = [° A@e"@, d.) de = lim @5 — By/I0 — Mme). 69) 


Thus, given A, , \2, and p; , wz , one obtains m(A) and hence a, , az, °° 
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APPENDIX D 


Derivation of Integral Equation (33) 


We give here a derivation of the integral equation (83) based upon 
the results of Section III. For simplicity we will assume that A(z) 
[and hence A,(x)] has no discontinuities. Then from equation (23) 


(I + H)A*(x) = u(z) (60) 
where u(x) is equal to 1 for all x > 0. Thus if f(z) is a function such that 
[I + H)“'u](x) = g(z) (61) 
then 
[ g(x) dx = ‘a A(x) dz. (62) 
Notice that if 
I+HA)\l+A)*=I4+K (63) 
then 
[ A@ ax = [+ Ky U@) ae 
a i, “GG de (64) 
where f(x) satisfies the equation 
[(Z + K)fl@) = u(x). (65) 


The kernel of J + K is recognized as that of equation (31) with a = z. 
Equation (33) therefore follows (for a = 7) from the symmetrization 
of f(a), exactly as in Section IV. However, the argument given here 
is independent of the length 7, which may be replaced by a. 

Using equations (60), (63) and (65), we have 


(I + H)*f(-) = [AGF (66) 
Therefore from equation (17) 


f(a) = [A@. (67) 
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Analysis of a Thin Circular 


Loop Antenna Over a Homogeneous Earth 


By 8. C. MOORTHY 
(Manuscript received March 31, 1969) 


In this paper, the current distribution on a bare conducting loop, sit- 
uated in free space over a semi-infinite medium, ts obtained for arbitrary 
time harmonic excitations. The loop 1s assumed to be thin, perfectly con- 
ducting and the standard one-dimensional integral equation and its Fourier 
series solution are used as the starting points. The field due to the current 
in the loop, where the semi-infinite medium is absent, is expressed as a 
superposition of plane waves. The tangential component of the field reflected 
by the interface, of the semi-infinite medium, ts evaluated using appropriate 
Fresnel reflection coefficients. This reflected field serves as a new source 
for the loop and induces a current on the loop. The field due to the induced 
current is treated in the same manner, and this process is repeated indef- 
initely. The summation of the original current and all the induced currents 
gives the steady-state current on the loop. 


I. INTRODUCTION 


It is well known that a high altitude nuclear burst generates an 
intense electromagnetic transient which covers a large geographical 
area. This transient field induces currents in communication circuits 
and, if these are large enough, adversely affects communication channels. 
One problem of particular interest in land-line communication is the 
coupling to large loops formed by cables. 

The loops formed by cables deployed in practical communication 
systems are very complex and cannot be analyzed easily. Typically, 
they run for many miles over inhomogeneous terrain and contain many 
junction points; nevertheless a great deal of insight into the behavior 
of these irregular loops can be obtained by studying the behavior of 
a large regular loop over a homogeneous ground. In this paper, the 
theoretical foundations for an analysis of a circular loop over a homo- 
geneous ground are developed, for time harmonic excitations. The 
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response of the loop for transient fields may be obtained by standard 
Fourier transform techniques. 

Various problems related to the thin circular loop have been con- 
sidered by numerous authors. These may be broadly classified under 
two categories: loop in an infinite homogeneous medium and loop in 
a stratified medium. 

In the first category, Pocklington, Oseen, Hallen, Storer, and Wu 
have analyzed the problem of a bare thin perfectly conducting circular 
loop in free space.”~° All these authors use the Fourier series expansion 
to solve the integral equation for the current in the loop. A good analysis 
of the problem is given in Wu’s paper. Adachi and Mushiake analyze 
the same problem by solving the integrodifferential equation for the 
current using an iterative method.’’* Mei, Baghdasarian and Angelakos, 
and Tang have discussed the direct numerical solution of the integral 
equation.’ A variational approach for determining the scattering 
cross section of a loop is given by Kouyoumjian.”” Problems concerning 
loaded loops are considered by Iizuka, Harrington and Ryerson, and 
Harrington and Mautz.’°-’* The analysis of a loop in a conducting 
medium is an extension of the analysis of a loop in free space and lends 
itself to certain approximations. Kraichman, Chen and King and 
King, and Harrison and Tingley have discussed the bare loop in a dis- 
sipative medium; Galejs has discussed an insulated loop in a dis- 
sipative medium.” Finally, the solution to the problem of two identical 
coaxial coupled loops in a homogeneous medium has been solved by 
Iizuka, King and Harrison.” 

In the second category the literature is mostly on small loops or 
magnetic dipoles over different types of media and is very extensive. 
(See Ref. 21 for an extensive bibliography.) Wait has considered the 
problem of loops over a homogeneous earth;”’”* recently, Sinha and 
Bhattacharya have analyzed the problem of a vertical magnetic dipole 
buried inside the earth.” 

The treatment of a small current carrying loop as a magnetic dipole, 
while satisfactory for many purposes, is nevertheless inexact. Moreover, 
we do encounter situations where the loop diameter is comparable to 
the wavelength of the excitation frequency and here we cannot assume 
the current to be uniformly distributed on the loop. A typical example 
would be the excitation of a loop by a narrow electromagnetic pulse 
which contains a broad spectrum of frequencies. The purpose of this 
paper is to solve the problem of a bare loop over a homogeneous earth 
taking into account the current distribution on the loop. 

Specifically, the system under consideration is a bare, thin, perfectly 
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conducting circular loop of mean radius b, formed by bending a cy- 
lindrical wire of radius a, situated in free space with its plane parallel 
to and at a distance d from the interface of a semi-infinite, linear homo- 
geneous, isotropic medium (Fig. 1). The loop is excited by an electro- 
magnetic wave of harmonic (exp jwé) time variation (the slice generator 
used to compute the admittance being a limiting case). It is assumed 
that k,a «< 1 and a < B, so that, if the loop were situated in a homo- 
geneous medium the current distribution induced by a specified time 
harmonic excitation is given by the so-called one-dimensional integral 
equation.” In addition it is assumed that d > a. 

It is desired to determine the current distribution [(¢) on the loop 
in the aforementioned system. This is accomplished in the following 
three, more or less self-contained, sections. In Section II the field of a 
circular filamentary current in free space is expressed as a superposition 
of plane waves. Section III evaluates the reflected field when an arbitrary 
field, of the general form obtained in Section II, is incident on the 
interface between free space and the semi-infinite medium. In Section 
IV the results of Sections II and JII are combined with the integral 
equation for the current on a loop in free space to determine the steady 
state current [(¢) on the loop using a recurrent ‘‘reflection-induction” 
scheme. 


Ii. THE ELECTROMAGNETIC FIELD OF A CIRCULAR FILAMENTARY CURRENT 


Consider a circular filament of current I[@) (Fig. 2a) of radius b 
situated in free space. The coordinate system is so chosen that the 
loop is parallel to the zy plane at a distance d from it. The loop current 
may be expressed as a surface current density K, in the z = d plane, 
in the following manner. 


K = asl (¢)d(p — b)d(z — d) (1) 


where p, ¢ and z are the cylindrical coordinates and a, the unit vector 
in the ¢ direction. The electromagnetic field due to K may be expressed 
as a superposition of plane waves as follows.” 


H® = [f (4p, £0, (P + m@ — Fm 


-exp {jk [le + my = (1 — 2? — m?)*e — d)]} dl dm, 


22d. (2) 
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Fig. 1—A thin circular loop over a homogeneous semi-infinite medium. 


EO =», ff {limp +a — Hea — F my 


— [Gd — m)P + ImQ\d — P — m’)*, £(1Q — mP)} 


-exp {jk [le + my F (1 — 2 — mi — dj} dl dm, 
z2Zd, (3) 





I (9) 





I (9) 










Z>O 
FREE SPACE FREE SPACE 
€o, Ho 





(a) (b) 


Fig. 2—(a) A filamentary loop current in free space, (b) A filamentary loop current 
over a homogeneous semi-infinite medium. 
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where 


PUL, m) = 02/80") [[ KuCe, a) exp [—iklle + my] ae dy, () 


Q(1, m) = ~(82/8n") ff” K.Ce, w) exp [she + my) de dy, ©) 


ke = we, and 4. = (u./e,)?. (6) 


The quantities 1 and m are in general complex and the integrals in 
equations (2) and (8) are contour integrals, the choice of contours 
being dictated by physical considerations. A possible choice of 1 and m 
is defined by the following transformation 


l = 7 cos | O<7, w being complex. (7) 
m= 7 sin y, 
Let 
I$) = eB I, exp (jn¢). (8) 


Substitution of equations (1), (7) and (8) into equations (4) and (5), 
and changing from rectangular to cylindrical coordinates yields the 
following equations 


+0 


P(r, ¥) = (k¢b/87) Ee (—3)" exp (inp) Jn(hobt)Tn-1 + Invx), 9) 


—* 


Ql, ¥) = —Ib/8=) Yo (—H" exp GnvIulbsb) Ter — Inv), (10 


where J, denotes the Bessel function of order n. 


Ill. CIRCULAR FILAMENTARY CURRENT OVER A SEMI-INFINITE MEDIUM 


Here again we consider the filamentary loop of Section II, but instead 
of being situated in free space it is situated over the homogeneous 
semi-infinite medium z < 0 (Fig. 2b). The total field in the region 
z = 0 consists of the primary field E®’, H™ of the filamentary current 
[equations (2) and (3)] and the field E“’, H‘” reflected by the interface 
z = 0. We proceed as follows to evaluate the latter. Let 


H(z, y,2) = I H,(1, m) exp [—jk.n.(l, m)-r] dldm, 2d, 


(11) 
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EM (e,y,2) = ff EC, m) exp [—ikan,(l, m)-t]dldm, 2 Sd, 


(12) 
where . 
H,(1, m) = [—a.P — a,Q + aIP + mQ)( — P — m4] 
-exp [—jk,d(1 — ? — m’)*], (13) 
E,(1, m) = y.{a.[ImP + (1 — P)Q\ — P — my? 
+ a,[(m? — 1)P — ImQ]\(l — P — m*)? + a,(mP — 1Q)} 


-exp [—jk,d(l — ? — m’)*, (14) 
n,(l, m) = —a,l — aym — a,(1 — P? — m)}, (15) 

and 
r=a,7 +a yy + a,z. (16) 


Each one of the constituent plane waves propagates in the “‘direction”’ 
n,(J, m). Let R,(l, m) and R,(l, m) represent the Fresnel reflection 
coefficients for the cases where the incident electric field is perpendicular 
and parallel respectively to the plane of incidence. Evaluation of the 
reflected field is achieved by resolving each plane wave into components 
with the electric field perpendicular and parallel to the plane of incidence. 
To this end we define a local coordinate system’ as shown in Fig. 3. 

The plane of incidence is defined by the unit vectors a, and n,(l, m). 
Let 

a, = (a, x n,)/[(1 = (a, n,) J’, (17) 
a =a, Xa,. 


Then a, , a; and a, form a right-handed coordinate system, a, is normal 
to the plane of incidence and a, and a; lie in the plane of incidence. 
In terms of a, and a, we have 


a, = (0° a m’)*(ma, a la,), (18) 
a, = (0 + m’)*(la, + ma,). (19) 


t The propagation vector no becomes complex for certain values of 1 and m, the 
associated plane wave being inhomogenous. When this happens some of the terms 
used in the analysis (for example, plane of incidence, coordinate system, normal, 
and so on) become inaccurate and should be interpreted in a generalized sense. 
The results obtained are quite general and valid. 
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Fig. 3—Local coordinate system. 
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The field vectors H, and E, are now resolved into components per- 
pendicular (H,, , E,,) and parallel (H,, , E.,) to the plane of incidence 


as follows 
H, = H., + H,,, 
where 
H,. = a,(P + m’)*(1Q — mP) exp [—jk.d(1 — ? — m’)4, 
Ho = (a. — P — m)* — a,(P + m) (IP + mQ) 
-exp [—jkd(1 — ? — m4], 
EB, = Bou Bay 5 
where 
Eo. = an? + m)*(L — P — m*)*(P + mQ) 
exp [—jk.d(1 — ? — m’)}], 
E,, = nla, — a(l? + m) 30 — 2? — m?)*\(mP — 1Q) 


-exp [—jk,d(1 — 2? — m’)*]. 


It may be easily verified that 
H,) = nM, X E,1), 
Ea = —7o(N x H,.). 


(20) 


(21) 


(22) 


(23) 


(24) 


(25) 


(26) 
(27) 
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The reflected field corresponding to the incident field defined in 
equations (11) and (12) may be represented as 


A Gaya / [ Hida eis te Cwldldn 220, 


E(x, y,2) = iP E,(1, m) exp [—jk,.n.(l, m)-r] dldm, z= 0, 


(29) 
where 
n(l,m) = —a,l — aym + (1 — 7? — m)4a,. (30) 
Let 
H, = Ho, + Hey, (31) 
E, = Eo, + Ex. (32) 
According to Fresnel’s laws 
Ex. = Rill, mE. , (38) 
Hoi = Ry, mH. , (34) 


where 
Ri(l, m) = {uk,(l — P — m?)! ~ w[k? — KP + m’)}*}/ 
fuk (1 — ? — my! + aw fk? — bP + m’)]}}, (35) 
R,(l, m) = {u,k?(1 — P — m?) — pk lk? — bP + m?)]*}/ 
{uk*(1 — P — m’)? + klk? — WP + m3}, (36) 


k? = —jou(o + joe). (37) 

We also have the relations 
Hoy = 7, (2 X Eni), (38) 
E,, = —7(M2 X Hz,). (39) 


Substitution of equations (33), (34), (38) and (89) into equations (31) 
and (82) yields 


E, 7 RE, aoe nolty (Ne x H,.), (40) 
H, = R&A + no iM, X E,1). (41) 
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Equations (40), (41), (28) and (29) specify the reflected field com- 
pletely. The ¢ component of the electric field is of special interest since 
it induces a current in an actual circular loop. Taking the ¢ component 
of equation (40), using polar coordinates for x, y, and equation (7), 
we obtain 


Ex, = 9{Ry(r)(1 — 7°)*(P sin ¥ — Q cos ¥) sin (p — 4) 
— Rin — 7)(P cos y + Qsin y) cos (y — 9)] 
-exp [—jk,d(1 — 7°)?], (42) 
where 


R(7) = [u.k(1 = 7’)? —_ pk (k? —_ ke7?)3]/ 


[uok?(1 — 7°)? + pki (k? — k2r”)?], (43) 
Ri(r) = [uk,(l — 7°)! — p(k? — ker)*]/ 
[wk(1 — 7°)? + poo(k? — k27?)*]. (44) 


P(r, v), Q(7, W) are defined in equations (9) and (10), and after re- 
arrangement yield the following equations 


P cosy + Qsin py = j(4r) "kb > IJIn(kobr) exp | in(v - =) | 


(45) 
Psin y — Qecosy = (4rr) ‘nk, > TJ a(kobr) exp | in(v — *) | ’ 
(46) 


where the prime denotes differentiation with respect to the argument. 
The ¢ component of the reflected field is contained in equation (29) 
and is explicitly given by 


Es" (p, 6,2) = i | Boslr, v) 


-exp [jk,p7 cos (W — $) — jk,2zd1 — r’)*] dyrdr. (47) 


The contour of integration C in the complex y plane is to be chosen 
from physical considerations. An examination of equations (42), (45) 
and (46) shows that the y integrals J, , Jz are of the form 


ja = [2 exp | in(y - z) + jkepr cos (p — | dy. (48) 


cos 
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Since J,,2 are solutions of Helmholtz equation in cylindrical coordinate 
system, we expect them to be cylindrical functions (compare with Ref. 
26, pp. 367-368). The requirement that they be bounded when the 
argument of the Bessel functions approach zero determines that they 
are the ordinary J functions. Thus we obtain, 


g: = 2rntk.pt) 'J.(k.p7) exp (jn), (49) 
Jz = —J2aJi(k,pr) exp (yng). (50) 


Substitution of equations (42), (45), (46), (48), (49) and (50) into 
equation (47) yields the following expression for the ¢ component of 
the reflected field | 


+0 


H$(p, $2) = 2m Ys Lzs(o,2) exp (ind), (61) 
where 
Zn(p, z) = (4a) *kobn, [ [n°(k2pb) "JS .(kopt) J n(kobr)r ‘(1 ae 7°)?R,(r) 
«0 


— Si(k,br)Ji(kept)t(l — 7°) *Ri(7)] 
-exp [—jk.(e + d(1 — 7°)*] dr. (52) 


In particular, 
2u(b, d) = (Ae) *Rbn, | En2 (hsb) 73 (kaba)e AL — 2*)'R,(2) 
0 


— JP(k,br)rd — 7) ?R,(7)] exp [—j2k,d(l — 7)? ] dr. 
(53) 


IV. CONDUCTING CIRCULAR LOOP OVER A SEMI-INFINITE MEDIUM 


In this section we consider a perfectly conducting thin circular loop 
of mean radius b situated over the semi-infinite medium (Fig. 1) with 
its plane parallel to the interface. The radius of the wire forming the 
loop is a and the height of the loop above the interface is d. 


4.1 Current Distribution 


Let E%(¢) be the ¢ component of the applied electric field, I“ ¢) 
the current distribution that would be created by Ej (@) on the loop 
if it were situated in free space and [(¢) the current distribution caused 
by £3” @) when the loop is situated as shown in Fig. 1. Typical examples 
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of Hi @) are the slice generator (used in admittance computations) 
and the ¢ component of the electric field that would exist at the loop 
location in the absence of the loop (as in scattering problems). 

The relation between the applied tangential electric field and the 
currents induced on a loop takes the form of coupled integral equations 
which are extremely difficult to solve.” However if the loop is “thin” 
(a <b, a<) the current distributions on the loop is given accurately 
by the so-called ‘one-dimensional integral equation.”” Thus we have 


m9) = [ o@ - 61°@ as, (54) 


where 


G(x) = i(dn)"*n bad cos x + (k,b)™ Flea 


i . {R(z) exp [—JkR(2)]}ase sine 48, (55) 


R(x) = b[4 sin? (2/2) + (a7/d?)}?. (56) 


A formal solution to equation (54) is obtained by using Fourier series 
representations as follows. Let 


+0 


1() = YS 12 exp (nd), (57) 
BS) = Yad exp (nd), (58) 
Ge — 61) = D 8, exp Lin — 6°01, (69) 
where 
12 = @ay J” 1@) exp (—ind) a, (60) 
as? = @ayt [ S°@) exp (—ing) de, 61) 
B= (ay? f " G(@) exp (—jnz) de. (62) 


Substitution of equation (57), (58) and (59) into equation (54) yields 
the following relation between the Fourier coefficients 


I,” = (2) "(on” /Bn).- (63) 
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The current distribution is given by 


+0 


Tg) = (2x) Py (a,”"/Bn) exp (jng). (64) 


Let L{? (6) be the tangential electric field, at the loop, of the reflected 
field whose incident field is caused by I‘ (¢) and let I“ @) be the 
current distribution on the loop caused by HS” ¢). Let 


E3°@ = a,” exp (jn), (65) 
IM%@) = Tx? exp (jnd). (66) 

Then 
I) = (Qa)7"[ak? /Bal (67) 


[compare with equation (63)]. Also from equations (51) and (65) we 
obtain 


a = QrIz (6, d). (68) 
Hence 
TST Os; DO / Bali (69) 


In general, if J‘ denotes the nth Fourier coefficient of current distribu- 
tion I” (¢), induced by the kth reflected field we have 


Le Ee 6; d)/Bnl, k= I, 2, “ete (70) 
Let 
16) = XS I, exp (jnd). (71) 
Then 
n= DE (72) 
k=0 
that is, 


=O FIP +I +e, 
= I," {1 + [2n(0, d)/Bal + [en(b, @)/Bal” + +++}; 
= I,° {1 — [z,(b, d)/8,]}"", provided | [z,(b, d)/B,] |< 1. (78) 
Henceforth, we assume that | z,(b, d)/@,| < 1. Substituting for If” 


LOOP OVER EARTH 1227 


in equation (73) from equation (63) we obtain 


I, = (2m)*a,” [B, 21D; d)\", (74) 


I@) = (2r)" 2, a,” [Bn — 2n(b, d)}"* exp jnd. (75) 


The following expression for 6, 1s obtained by simplifying equation 
(62): 


B, = j(4rk,b’)'9,(2/z) 


| (2.06.5) Moas(a) + Mar(a)] — 12M (0)}eoeeine d8, (76) 





where 
1 a /2 F 
M,(2) = . (sin” @ + 2° /4b°)~? 
“0 
-exp [—j2k,b(sin’ @ + 2°/4b’)*] cos (2n6) dé. — (77) 
Let 
w/2 
K, = (2/n) / M,(a) d6. (78) 
0 a—2a sin @ 
Then 
B, = j(4ak,b°)"n.[$(kob) (Kanai + Ka) — 1K]. (79) 


4.2 Input Admittance 


Let the primary source be a slice generator (delta function source) 
of voltage V located at @¢ = 0. That is 


Ey? @) = [V6@)/d}. (80) 
Substituting equation (80) into equation (61), we obtain 
a? = (V/2rb). (81) 


Substitution of equation (81) into equation (75) yields 


1(6) = Vids") 1B, — (0, I" exp (ind). (82) 


=~00 


The admittance Y at the input terminals at ¢ = 0 is given by 


Y = 1(0)/V = (4n°6)" Do (8, — (0, DT. (33) 
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The use of the delta function generator will give rise to an infinite 
input admittance’ so that the series in equation (83) is divergent. 
However this difficulty is overcome by computing the difference be- 
tween the admittances of two loops of different radii.” 


V. SPECIAL CASES 


5.1 The Magnetic Dipole Over a Semi-Infinite Medium 


When the radius, b, of the loop becomes very small compared to the 
wavelength (that is, k,b «< 1) the current distribution on the loop 
becomes uniform. This enables us to retain only the zeroth terms in 
the infinite series representing the different quantities of interest. The 
field of a dipole over ground is well discussed in the literature and will 
not be considered here. The input impedance of a dipole over a semi- 
infinite medium is of considerable interest and may be obtained from 
equation (83). Thus we obtain 


Zin = 4n°b[8. — 2,(b, d)]. (84) 


The term 47°b8, represents the input impedance of the loop in the 
absence of the semi-infinite medium and the term —47”bz, represents 
the contribution of the semi-infinite medium. That is 


fig SS Lge Ligases (85) 
where 
Zoi = An’ dB, , (86) 
Zeee = —4n°bz,(b, d) 


(bob) ne ( ” Pkebt)r(L — RA) 


-exp [—j2k,d(1 — r)*] dr. — (87) 
5.2 Thin Circular Loop Over a Perfectly Conducting Plane 
Let 
2*(b, d) = limz,(0, d). (88) 


When co — - the reflection coefficients simplify to 
R,(r) = +1, R,i(r) = —l. (89) 


Substituting equation (89) into equation (53), we obtain 
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2n(b, d) = (4m) "hobn, 
/ [n?(k.b) ?Ia(kobr)7 (1 — 2°)? + JP(k,b7)r(l — 7°) 4] 
0 


-exp [—j2k,d(1 — 7°)*] dr. (90) 


The integral in equation (90) may be simplified, by making use of the 
properties of Bessel functions, and yields 


z&(b, d) = (4b) *nld(hob)*(Sn-1 + Sua) — 2S, (91) 
where 


c= fr — 7) 20,b2) exp [—i2k.dl — dr. (02) 
0 
An alternate expression for ¢, is obtained as follows: 


Pier= i | F (Ok bcsin By08 CRO) 20: (93) 
0 


substituting equation (93) into equation (92) and changing the order of 
integration we obtain 


Sa = j(kob) *M,(2d), (94) 
where M(x) is defined by equation (77). Therefore, 
z*(b, d) = j(4rk,b”)™’ 
“nol (3) (hob) "[Mn+1(2d) + M,-1(2d)] — n°M,(2d)}. (95) 


A comparison of equations (95) and (76) reveals a strong similarity 
between the expressions for 8, and 2*(b, d). The input admittance of 
a thin circular loop over a perfect ground plane is given by 


+00 


Y = (4n°b)* Do (8, — 28(b, dy”. (96) 

The above formula agrees, with that derived by Iizuka and others,” 
for the input admittance Y™ of a loop in the presence of an identical 
coaxial loop carrying a current distribution which has an opposite 
phase. They use the simpler Kernel given by Storer’ to computer 8, , 
a procedure satisfactory for small loops. However they make use of the 
similarity between 6, and z*(b, d) to compute the latter using the 
approximate expressions given by Storer. This procedure will yield 
erroneous results for large separation, d, for the following reason. 
The approximate expressions for 6, given by Storer, or for that 
matter Wu, are valid for k,a «< 1. The corresponding condition to be 
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imposed in the evaluation of 2*(b, d) is, (2k,d) « 1. Thus it is seen that 
the approximate expressions of Storer give accurate values of z*(b, d) 
only for very small separations. 


VI. NUMERICAL COMPUTATIONS 


Equation (76) was further analyzed and approximate expressions 
for the 8, coefficients were derived in terms of Bessel and Legendre 
functions. The integral defining the z, coefficients could not be expressed 
in terms of known functions and so was evaluated by numerical integra- 
tion. However, in the evaluation of 2* it was possible to use some of 
the formulae developed for 8, for small values of d [compare with 
equation (95)]. The numerical integration was carried out by using 
the Romberg integration scheme. All the computations were done by 
FORTRAN programs on a GE-635 computer. 

Figure 4 shows the variation of the input admittance of a loop, 
when it is in free space, over moist earth and over an infinitely con- 


B IN MILLIMHOS 


G IN MILLIMHOS 





Fig. 4—Input admittance of a circular loop, over different media, as a function of 
frequency (d/b = 0.25; a/b = 0.002). G—conductance; B—susceptance. I—loop in 
free space; II—loop over moist earth; III—loop over perfect ground (¢ > @). 
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ductive ground plane, as a function of frequency. The values of the 
various parameters used in these calculations are 2rb = 30 meters, 
f = 5 MHz to 13 MHz, d/b = 0.25, a/2b = 0.001, ¢ = 5 millimhos/ 
meter, and ¢/é9 = 15. The last two parameters characterize the moist 
earth. The frequency range was deliberately chosen so that the moist - 
earth cannot be approximated either as a highly conductive medium (low 
frequency approximation) or as a lossless dielectric (high frequency 
approximation). 

The real part G of the input admittance shows its characteristic peak 
near k,b = 1 (these occur for values of k,b near 1, 2, 8, ---) but the 
exact location of the peak as well as its magnitude depends on the 
medium below the loop. When the loop is located above a highly con- 
ducting ground plane the resonance is particularly sharp at k,b = 1 
since at this frequency the loop and its image are exactly half wavelength 
apart. The imaginary part B of the input admittance changes from 
inductive to capacitive near k,b = 0.7 and back to inductive near 
kb = 1. Here again the transition at k,b = 1 for the loop over a highly 
conducting ground plane is almost discontinuous. 

Figure 5 shows the variation of the input admittance of a loop over 
a highly conductive ground plane as a function of the distance between 
the loop and the ground plane. The curves are plotted for k,b = 1, 
a/2b = 0.001 and d/b ranging from 0.25 to 5.0. It is observed that as d/b 
increases, the input admittance approaches the free space value in an 
oscillatory manner. 

The aforementioned calculations are presented only as examples 
of the different types of investigations that may be carried out based 
on the theory developed. The computer programs developed in this 
connection are very general and may be used for computations of loops 
as large as k,b = 10. 


VII. SUMMARY 


The problem of a thin, perfectly conducting, circular loop situated 
in free space over a semi-infinite homogeneous isotropic medium was 
solved. Iixpressions for the current distribution on the loop caused by 
an arbitrary time harmonic source [equation (75)] and the input admit- 
tance [equation (83)] were derived. The results are applied to special 
cases to evaluate the input impedance of a vertical magnetic dipole 
over a semi-infinite medium [equation (84)] and the input admittance 
of a circular loop over a perfectly conducting ground plane [equation 
(96)]. Some numerical results are also given. The analysis for the general 
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Fig. 5—Input admittance of a circular loop over an infinitely conducting ground 
plane as a function of height (kob = 1, a/b = 0.002). G—conductance; B—suscept- 
ance. 


case of a circular loop in an arbitrary homogeneous medium (as opposed 
to free space assumed here) over another homogeneous medium can 
be easily done by modifying the parameters. 
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A Three-Radiometer Path-Diversity Experiment 


By R. W. WILSON 
(Manuscript received February 27, 1970) 


Data from the Crawford Hill sun tracker have confirmed the expecta- 
tion that frequencies above about 10 GHz experience large attenuations 
on an earth-space path when heavy rain is encountered.’ Tillotson has 
proposed using frequencies of about 20 and 30 GHz for a domestic 
communication satellite system.” In that case, the required continuity 
of service would be achieved by using two or more diversity ground 
stations. The three-radiometer path-diversity experiment discussed 
here was designed to work in conjunction with the sun tracker to deter- 
mine the advantage of diversity ground station pairs at these frequencies. 

A common method of measuring atmospheric attenuation on earth- 
space paths at microwave frequencies has been to measure the brightness 
temperature of the atmosphere. In the absence of scattering, one can 
obtain the attenuation in a path through the atmosphere from the 
relationship between the physical temperature (7), attenuation (A), 
and brightness temperature (T’). 


T, = (1 — 10 ae (1) 


At the frequencies of interest, however, scattering from raindrops is 
not negligible and, in addition, one can only estimate 7’. The sun tracker’ 
is set up to measure 7’, in a direction sbout 2.5° away from the sun and 
A in the direction of the sun simultaneously. Experience with these 
measurements has shown that by choosing the proper value of T (and 
one can do better still by making 7 a function of T,). Equation (1) and 
measurements of 7, will give dependable values of A up to about 12 dB 
(see Fig. 11 of Ref. 1). Thus, in the interest of simplicity, a 16 GHz 
diversity experiment has been set up using fixed-pointed radiometers 
rather than, say, additional sun trackers. Further advantages of this 
method are that all the measurements are made at the same elevation 
angle, and data are taken for 24 hours/day. A disadvantage is the limited 
measuring range (12 dB, which translates into about 36 dB at 30 GHz). 
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Further evidence of the ability of radiometers to measure attenuation 
is given in a companion paper.* 

The three radiometers are located on a northwest to southeast line, 
the middle one being at Crawford Hill, a few yards from the sun tracker. 
The northwest radiometer is in Sayreville, N. J., seven miles from Craw- 
ford Hill, and the southeast one, at the time these data were taken, was 
located at Exit 114 of the Garden State Parkway, two miles from Craw- 
ford Hill. The antennas are pointed at an azimuth of 226° (approxi- 
mately perpendicular to the line along which they are located) and 
at an elevation of 32°, near the center of the apparent motion of ATS-5.° 
The beam widths are 1.05° for the remote antennas and 0.65° for the 
Crawford Hill antenna. 

The RF and IF parts of the radiometers are very much like those of 
the 16 GHz receiver on the sun tracker.’ A ferrite switch, driven at 
10 Hz, switches the input of the down converter between the antenna 
and a reference termination whose temperature is stabilized at 30°C. 
At the output of the receiver two voltages are developed on 4 second 
time constants. One is proportional to the average receiver output during 
the times that the input of the receiver is connected to the antenna 
and the other to the output while the receiver is connected to the ref- 
erence termination. The level derived from the reference termination 
is used for an automatic gain control (AGC) loop and the difference 
of the two levels produces the main output. 

All three radiometer outputs are brought to the Crawford Hill 
Laboratory for recording. The remote radiometers have their outputs 
(essentially de levels) connected to voltage controlled oscillators (VCO) 
operating in the voice band. The VCO outputs are sent over telephone 
lines to Crawford Hill where discriminators convert the frequencies 
back to voltages for recording on strip charts.* 

The data discussed here are based on the period April 1 through 
August 7, 1969. Large but not perfect correlation among the attenua- 
tions is often observed for the three stations but there is usually suffi- 
cient time shift between the attenuations at the more widely separated 
stations to give a good diversity advantage at high attenuation. 

During the period that the data were being collected, it was noticed 
that attenuation at Sayreville (the most northerly station) often pre- 
ceded that at the other locations by a time large enough to prevent over- 
lap of high attenuations, but that the time difference between the events 
at Crawford Hill and the Parkway site (two-mile separation) was 
insufficient for much diversity advantage. 


* The data are now also being recorded digitally on magnetic tape. | 
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A statistical summary of the measurements taken for the two-, 
seven- and nine-mile spacings is shown in Fig. 1. What is plotted is the 
percentage of the measuring period (3080 hours) that the attenuation 
shown on the abscissa was exceeded. Data from the three radiometers 
are plotted individually in Curves (1), (2) and (38) and in combination 
as the three possible diversity pairs in Curves (4), (5) and (6). One 
sees explicitly that the seven- and nine-mile separations result in 
a large diversity advantage whereas the two-mile spacing results in 
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16 GHZ ATTENUATION IN DECIBELS 


Fig. 1—16 GHz radiometer rain data, April 1, 1969 through August 7, 1969; 
total time 3081 hrs. Distributions of attenuation for the individual stations: 
(1) x- - --x Sayreville; (2) o—— © Crawford Hill (3) o2—————0 Parkway. 
Distributions of attenuation for pairs of stations in diversity: (4) x- — - —-x Crawford 
Hill to Sayerville, 7 miles; (5) o—— © Parkway to Sayreville, 9 miles; (6) 
o—————-0 Parkway to Crawford Hill, 2 miles. 
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much less. For example, at the 12-dB level, the data from the diversity 
pairs separated by seven and nine miles show about a factor of 50 
improvement over the individual distributions. The difference between 
the curves near the bottom of Fig. 1 should not be taken as very sig- 
nificant since the data in those portions of the curves are from only a 
few rains. The attenuation scale for Fig. 1 is based on T' = 273° which 
is a good fit to 1968 sun tracker data. During the 129 days used for 
this data sample, 15.26” of rain fell at Crawford Hill. 

Thus, spacings of from five to ten miles provide considerable diversity 
advantage on earth-space paths at 16 GHz. The experiment is con- 
tinuing but the Parkway station has been moved from two miles to 
twelve miles southeast of Crawford Hill. This will make available 
diversity data for spacings of seven, twelve and nineteen miles. 
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First Result from 15.3-GHz Earth-Space Propagation Study 


By A. A. PENZIAS 
(Manuscript received February 27, 1970) 


Consideration is being given to the use of frequencies well above 10 
GHz for satellite communication. One of the problems encountered in 
the use of these high frequencies is the attenuation caused by precipita- 
tion. A considerable amount of pertinent data has been obtained in 
the 16- and 30-GHz sun tracker and 16-GHz radiometer programs under- 
way at the Crawford Hill Laboratory.’'? From the data obtained from 
radiometer measurements, one may calculate the attenuation expected 
on an earth-space path. Such attenuation calculations have been shown 
to be in good agreement with sun tracker measurements.’ The present 
experiment was designed to check further the validity of the radiometer 
results by a direct comparison of the data obtained from a 16-GHz 
radiometer with the attenuation measured by transmission from a 
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satellite at a nearby frequency. This experiment was made possible by 
the inclusion by NASA of two 200 MW, 15.3-GHz transmitters aboard 
the Fifth Applications Technology Satellite (ATS-5). 

The Fifth Applications Technology Satellite was launched into a 
synchronous orbit in early August 1969. It arrived at its station at 
lon 108° a month later. Owing to a partial malfunction during the second 
stage of the launching sequence, the satellite spins at a rate of about 
60 rpm. As a result signals are received from the satellite for only 40 
milliseconds out of each second, when the transmitting antenna momen- 
tarily points at the earth during each revolution. Although this reduces 
the average signal level by 14 dB, it is still possible to obtain a measuring 
range of approximately 20 dB with our receiving system. 

The receiving antenna is the 20-foot horn-reflector at Crawford Hill. 
Although originally designed for the 2.39-GHz Echo project, it was 
found to be quite satisfactory at 15 GHz. 

The experiment employs two separate receiving systems, the 15.3- 
GHz satellite receiver and a 16-GHz radiometer supplied by R. W. 
Wilson. They are mounted at the antenna throat to receive orthogonal 
polarizations, with their outputs recorded simultaneously on a two-pen 
chart recorder. 

The satellite receiver uses a conventional 15.3-GHz waveguide bal- 
anced mixer receiver with an IF amplifier 100-kHz wide at 70 MHz. 
Since signal phase information is not required, no phase locking to the 
signal is used; the local oscillator is merely stabilized at 15,230 MHz. 
The output of the IF detector is connected through a capacitor to a 
peak detector. This peak detector is sensitive only when the transmitted 
signal is “on”, hence more or less maintaining the signal to noise ratio 
expected without spinning. The discharge time of the circuit was de- 
signed to be ~4 sec in order to reduce the noise fluctuation level by 
“averaging” over about four pulses. 

Data were taken at our site for all rainstorms occurring during the 
time the satellite was transmitting in the three-month period following 
the initial ATS-5 transmitter turn-on on October 4, 1969. In only one 
of these storms (November 19 and 20, 1969) did the attenuation exceed 
3 dB. Data taken during this storm is shown in Fig. 1. It is a plot of 
measured attenuation of the received satellite signal versus attenuation 
computed from the 16-GHz radiometer record at corresponding times. 
The size of the bars indicate the estimated measurement errors. The 
dotted line corresponds to an attenuation ratio of 1.1 between 16 GHz 
and 15.8 GHz. (The attenuation due to rain increases approximately 
with the square of the frequency in this wavelength region.) A good 
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16 GHZ RADIOMETER ATTENUATION IN DECIBELS 
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Fig. 1—Data recorded during storm on November 19 and 20, 1969. 


correspondence between the radiometer and satellite data is evident 
in the figure. 

The long-term stability of the signal transmitted from the satellite 
was not good enough to permit absolute attenuation measurements 
over the duration of the rainstorm (about seven hours). Instead, at- 
tenuation was measured incrementally for short periods of heavy rain, 
each some few minutes in duration, which appeared as distinct ab- 
sorption features on the record. The attenuation points plotted in the 
figure are thus actually differences from ‘baselines’ drawn between 
fiducial points of low rain attenuation adjacent to each feature. A 
corresponding subtraction was also made in each case for the radiometer 
record. In no case did this correction exceed 0.3 dB at 16 GHz. 

In addition to the data from this storm, attenuation due to rain in 
excess of ~1 dB was noted on the radiometer record on a number of 
other occasions. In each case a corresponding dip in the received 
satellite signal was also recorded. Within the accuracy of this latter 
record, typically +-0.2 dB, definite correspondence of the records was 
obtained, indicating clearly that the radiometer data is a reliable measure 
of attenuation over the range of storm intensities measured so far. 
(Note added in proof: In the spring of 1970, a number of rainstorms 
were measured with results quite similar to those reported above. The 
most intense storm for which reliable data was obtained occurred during 
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the afternoon of April 24, 1970. The peak measured 15.3-GHz signal 
attenuation and the corresponding 16-GHz radiometer record were 
9.8 dB and 10.6 dB respectively.) 
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Intersymbol Interference and the P/AR Meter 


By J. H. FENNICK 


(Manuscript received February 11, 1970) 


P/AR stands for peak-to-average ratio. The instrumentation to 
measure this ratio has been assembled and obtains a single number 
evaluation to the dispersion in a transmission medium by measuring 
the peak and the full wave rectified average of a pulse stream.’”” 

The P/AR meter has been under exploratory study and use for 
several years now. Many empirical studies have been made relating 
the P/AR rating to data transmission performance in the presence of 
gain and phase distortions. This study was undertaken to demonstrate 
the relationship between P/AR and intersymbol interference in pulse 
transmission caused by deviations from linear phase of passive net- 
works. 

Appropriately weighted peak and rms intersymbol interference has 
been calculated for 19 passive all-pass networks. The results were 
correlated with the P/AR ratings of the networks to determine the 
relationship between P/AR and intersymbol interference due to phase 
distortion. The values of P/AR ranged from 30.86 to 97.58. The results 
show that P/AR has a correlation coefficient of —0.94 with rms inter- 
symbol interference and —0.98 with peak intersymbol interference. 

K. D. Sunde demonstrated in 1954 that rms and peak intersymbol 
interference in pulse transmission could be calculated by the following 
procedure.’ The departures from a linear phase characteristic may be 
represented by a Fourier series expansion of the form f(w) = 


1246 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1970 


>>, 6, sin nwt. The peak intersymbol interference, based upon small 
echo theory, then becomes I, = >, | b, | and the rms interference be- 
comes I, = [1/2 >> v2)”. 

The phase characteristics of 19 all-pass networks were measured every 
50 Hz over the frequency range 300 Hz to 3250 Hz. The departures 
from linearity were determined by subtracting the actual measured 
value at each frequency from that obtained by a linear least squares 
fit to these data. The resulting departures were then weighted by the 
P/AR spectrum and expanded in a Fourier series by means of the fast 
Fourier transform (FFT). The FFT generated the first 128 terms of the 
series, the first 80 of which were sufficiently accurate and significant 
to give a good estimate of J, and J,. The results were then correlated 
with the calculated P/AR values of the networks. 

Scatter diagrams of J, vs P/AR and J, vs P/AR are shown in Figs. 1 
and 2. The linear correlation coefficients of P/AR with I, and I, were 
—0.941 and —0.978. This study demonstrates the high correlation of 
the P/AR rating system with rms intersymbol interference and hence 
with eye openings on data sets, and places more confidence in the value 
of the P/AR system. 
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Fig. 1—Scatter diagram of I, vs P/AR. 
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Fig. 2—Scatter diagram of I, vs P/AR. 
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